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Abstract 

Consider the partial sums {St} of a real-valued functional F{^{t)) of a Markov chain {$(i)} with 
values in a general state space. Assuming only that the Markov chain is geometrically ergodic and 
that the functional F is bounded, the following conclusions are obtained: 

Spectral theory: Well-behaved solutions / can be constructed for the "multiplicative Poisson 
equation" (e"^ P)f — A/, where P is the transition kernel of the Markov chain, and a G C 
is a constant. The function / is an eigenfunction, with corresponding eigenvalue A, for the 
kernel (e"-^P) = e°'^^^^ P{x,dy). 

A "multiplicative" mean ergodic theorem: For all complex a in a neighborhood of the origin, the 
normalized mean of exp(Q;S't) (and not the logarithm of the mean) converges to / exponentially 
fast, where / is a solution of the multiplicative Poisson equation. 

Edgeworth Expansions: Rates are obtained for the convergence of the distribution function of 
the normalized partial sums St to the standard Gaussian distribution. The first term in this 
expansion is of order and it depends on the initial condition of the Markov chain 

through the solution F of the associated Poisson equation (and not the solution / of the 
multiplicative Poisson equation) . 

Large Deviations: The partial sums are shown to satisfy a large deviations principle in a neigh- 
borhood of the mean. This result, proved under geometric ergodicity alone, cannot in general 
be extended to the whole real line. 

Exact Large Deviations Asymptotics: Rates of convergence are obtained for the large deviations 
estimates above. The polynomial pre-exponent is of order (l/\/i), and its coefficient depends 
on the initial condition of the Markov chain through the solution / of the multiplicative 
Poisson equation. 

Extensions of these results to continuous-time Markov processes are also given. 
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1 Introduction 



Consider a Markov process ^ = {^(t) : t € T} taking values in a general state space X, and with time 
being either continuous, T = [0, oo), or discrete, T = {0, 1, . . .}. Let F : X -^■ M be a given functional 
on the state space of 

Our interest lies in the long-term behavior of 

St= f Fms))ds, teJ, (1) 

J[0,t) 

where in discrete-time the integral is a sum, and St are simply the partial sums 

n— 1 

Sn = Y.F{^{i)), n>l. (2) 

i=0 

1.1 Multiplicative Ergodic Theory 

For simplicity we first discuss the case of a discrete-time Markov chain $ with a countable state space 
X. If * is positive recurrent with invariant probability measure tt, then for any F with finite mean 

af) = j:.<oc)f{x), 

^E^[Sn] ^ 7r(F), n^oo, (3) 

where x = $(0) is the initial condition, Sn are the partial sums defined above, Px is the law of # 
conditional on $(0) = x, and is the corresponding expectation. 

Often we can quantify the rate of convergence in (3) by showing that the following limit exists, 

F{x) = hm Ex[Sn-n7r{F)], (4) 

n— >oo 

where, in fact, the function F solves the Poisson equation: 

PF = F-F + tt{F). (5) 

Here P denotes the transition kernel of P{x, y) :=Pr{$(l) = y ] $(0) = .x}, and P acts on functions 
/ : X — > M via Pf{x) = -P(a;, Results of this kind hold for a wide class of Markov chains 

on a general state space, as shown in [37] in discrete-time and in [36, 38] in continuous-time. 

In this paper we seek multiplicative versions of the ergodic results in (3)-(5). Let a G C, and 
consider the product 

n-l 

J] exp(aF($(z))) = exp(a5„). 

i=0 

For countable state space chains in discrete-time, multiplicative results corresponding to the ergodic 
theorems (3)-(5) were established in [2] when a is a real number. The mean ergodic theorem (3) 
corresponds to the multiplicative limit 

i log E^[exp(a5n)] ^ A(q;), n ^ oo, (6) 

for some analytic function A{a) G M, and the stronger limit theorem (4) has the multiplicative coun- 
terpart 

/^(x) = lim Ea.[exp(aS'„ - nA(a))], (7) 
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where fa solves the natural analog of (5), the multiplicative Poisson equation: 

PU = e^p(-aF + A{a))fa. (8) 

Our first aim is to provide natural conditions under which the multiplicative ergodic results (6)- 
(8) hold. As we indicate in several instances, our conditions (and the results obtained under them) 
are often optimal or near-optimal (see Proposition 6.2, and the examples in Section 7). Equipped 
with these results, we go on to prove precise expansions for some classical probabilistic limit theorems 
satisfied by the partial sums Sn- Specifically, the multiplicative mean ergodic theorem (7) leads to 
Edgeworth expansions for the central limit theorem and to exact large deviations asymptotics. 

There are numerous approaches to multiplicative ergodic theory and its related spectral theory in 
the literature; a brief survey is given at the end of this introduction. The conditions given in this 
paper considerably extend known criteria for the existence of solutions to the multiplicative Poisson 
equation and for the validity of the multiplicative mean ergodic theorem. 

Most closely related to the approach taken here are the results of [2], developed for discrete-time 
Markov chains $ on a discrete state space along the following lines. For any real a, define the new 
kernel Pa by 

Pa{x, y) = exp(aF(rE))P(x, y), x, y G X, (9) 

where P{x,y) is the transition kernel of the Markov chain In this notation, the multiplicative 
Poisson equation (8) can be rewritten as 

Pafa = ^afaj (10) 

with Xa = cxp(A(a)). That is, the solutions fa of the multiplicative Poisson equation (8) arc eigen- 
functions for the new kernel Pa, with associated eigenvalues A^. [Throughout the paper, we try to 
maintain the convention that lower-case letters denote quantities that are exponential versions of the 
corresponding upper-case letters; e.g., A = exp(A)]. 

Under a monotonicity assumption on F, it is shown in [2] that well-behaved eigenfunctions for 
(10) exist for real a in a neighborhood of zero. Based on such an eigenfunction fa with corresponding 
eigenvalue Xa, the twisted kernel Pa is defined, 

Pa{x,y)=X-'f-\x)Paix,y)faiy), (11) 

and the convergence in (7) is deduced from the properties of Pa- 

For bounded functionals F, and assuming only that $ is "geometrically ergodic," results corre- 
sponding to (6)-(8) are obtained in Section 4 of the present paper, for Markov processes $ on a general 
state space, in continuous- or discrete-time, and for complex a. For our purposes, a Markov chain * 
is geometrically ergodic if it is ^-irreducible, aperiodic, and a Lyapunov function F : X — [1, oo] exists 
such that the following condition holds: 



For a "small" set C C X, and constants 5 > 0, b < oo: 
PV <{1- d)V + bic . 



(V4) 



Precise definitions and a more general version of condition (V4) for Markov processes in discrete- or 
continuous-time are given in Section 2.2. 

Geometric ergodicity for $ is our main assumption, and it will remain in effect throughout the 
paper. Section 1.3 offers a discussion comparing (V4) to several of the standard assumptions in the 
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relevant literature, and in Section 7 geometric ergodicity is verified for several classes of important 
examples. Note also that what we call geometric ergodicity here is equivalent to the notion of geometric 
ergodicity used in [37], where it is stated slightly differently. 

In the following section we briefly describe the probabilistic implications of the spectral theory 
outlined above. Along a different direction, in [26] we extend our present results to the case of 
products of random matrices. This extension leads to an interesting and non-trivial application of the 
present ideas to a stability question arising from systems theory. 

1.2 Probabilistic Limit Theorems 

The multiplicative mean ergodic theorems in (6) and (7) offer precise information about the asymptotic 
behavior, as n — > oo, of 

mn{a) := E^[exp{aSn)], a G C . 

When a = iu is imaginary, mn{oe) is simply the characteristic function of the partial sums Sn, and it is 
well-known that information about the convergence of the characteristic functions leads to Edgeworth 
expansions related to the central limit theorem [17, 24, 46]. Similarly, when a is real, is the 

moment-generating function of the partial sums Sn, and the precise convergence of the corresponding 
log-moment generating functions to a smooth limiting A(a) as in (6) leads to exact large deviations 
asymptotics; see [13, 7]. 

Suppose $ is a geometrically ergodic Markov chain, and let F be a bounded, non-lattice, real- 
valued functional on the state space of In Section 5, we obtain an Edgeworth expansion for the 
distribution function Gn{y) of the normalized partial sums [Sn — wk^F)]/ ay/n, 

where is the asymptotic variance of Sn/\/n. In Theorem 5.1 we show that, for all x G X, 

+ o{n~^^'^), n — > oo, 

uniformly in y G M, where 7(y) denotes the standard Normal density, Q{y) is the corresponding 
distribution function, F is the solution to the Poisson equation (5) given in (4), and pz is a constant 
related to the third moment of Sn/ \/n. 

A similar expansion is obtained in the case of lattice functionals F. These results generalize the 
Edgeworth expansions in [41, 30, 8], where they are derived under much more restrictive assumptions. 
In particular, in all these papers the conditions given are stronger than Doeblin recurrence, which is 
significantly stronger than the form of geometric ergodicity assumed in this paper — see the discussions 
in Section 1.3 and Section 7. 

In Section 6 we discuss moderate and large deviations for the partial sums Sn- Under geometric 
ergodicity, the multiplicative mean ergodic theorem (7) implies that a moderate deviations principle 
(MDP) holds for the partial sums Sn- Note that geometric ergodicity is essentially equivalent to the 
weakest conditions known to suffice for the MDP [10, 11] (although weaker assumptions can be used 
to obtain the MDP lower bound). 

By standard large deviations techniques [13], the convergence of the log-moment generating func- 
tions in (6) to a smooth limiting A(a) can be used to prove large deviations estimates for the partial 
sums Sn'- Suppose $ is a Doeblin chain, and let F be a bounded, real- valued functional on the state 
space of In Proposition 6.2 we show that under the stationary distribution tt of the partial 




Gn{y) = Q{y) + 



(1 - y^) - F{x) 
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sums Sn satisfy a large deviations principle (LDP) in a neighborhood of the mean vr(F), i.e., for any 
c > tt{F) close enough to the mean 7r(F), 

^logP,{5n>nc} ^ -A*(c), n^oo, (12) 

where A*(c) is the Fenchel-Legendre transform of A(-). (A corresponding result holds for the lower 
tail.) 

Note that this result cannot in general be extended to a full LDP on the whole real line. For 
example, Bryc and Dembo [5] have shown that the full LDP may even fail for the partial sums of a 
Doeblin chain with a countable state space. 

Further, the more precise convergence result (7) leads to exact large deviations expansions analo- 
gous to those obtained by Bahadur and Rao [1] for independent random variables: For geometrically 
ergodic chains and non-lattice functional F, in Theorem 6.3 we obtain the following: For any c > 7r(F) 
close enough to the mean 7r(F), and all x eX, 

PASn>nc} |(^e-nA*(c)^ ^^^^ (13) 

where a G M is chosen such that A' (a) = c, fa (x) is the solution to the multiplicative Poisson equation 
(10), A*(-) is as in (12), and = A"(a). A corresponding expansion is given for lattice functionals. 

These results generalize those obtained by Miller [39] for finite-state chains, and those in [30], 
proved under conditions stronger than Doeblin recurrence (in [30] a version of the domination as- 
sumption in (15) below is assumed, together with additional regularity conditions). 

The problem of obtaining exact large deviations asymptotics (such as in (13) above) has been 
considered by [28, 42], using a "pinned" multiplicative mean ergodic theorem for a ■0-irreducible and 
aperiodic Markov chain. It is shown that for a "small" set C C X, 

hm i log E^[exp(a5„)I($(n) G C)] = A(a), (14) 

and from this, under additional conditions (assuming a variant of the "uniform domination" condition 
(15) discussed in the following section), large deviations expansions are proved along the same lines 
as indicated above. The difference here is that, because of the additional constraint imposed by the 
small set C in (14), the resulting expansions are not for the probabilities Px{Sn > nc} as in (13), but 
for the "pinned" probabilities Px{Sn > nc and $(n) G C}. 

Finally note that in much of the relevant literature authors often consider a Markov additive 
process model instead of simply the partial sums of a given Markov processes. For simplicity (and 
without loss of generality), we restrict our attention to the asymptotic behavior of the partial sums 
themselves. 

1.3 Related Approaches 

In this paper we attempt to place within a single framework results from two previously disparate 
research areas: The theory of positive operators as developed in [45, 44], where Tq, = (A„)^^ is the 
convergence parameter for the semigroup generated by Pa, and from the theory of positive harmonic 
functions for diffusions where A(a) = log(Aa) is known as the generalized principal eigenvalue [47]. The 
reason that the constant is given two different names is that, so far, the discrete-time theory of '0- 
irreduciblc Markov chains and the related continuous-time theory of positive harmonic functions have 
been developed independently. Looked at together, many of the results of the latter continuous-time 
theory can be replicated, improved, or generalized by lifting results from the discrete-time setting. 
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These and some other relevant approaches in the existing hterature are summarized below. As 
this literature is very extensive, the following discussion is not intended to be a complete review. 

A. i/'-irreducible operators. The most general approach to understanding the eigenfunction equa- 
tion (10) has been developed for discrete-time Markov chains, based on renewal theory and the 
theory of positive, V'-irreducible operators; see Nummelin's monograph [45]. In this framework 
A(q;) = — log(ra), where is the convergence parameter for the semigroup generated by the ker- 
nel Pa defined in (9). 

Although, in general, useful solutions to (8) cannot be constructed, if ^ is aperiodic and > 0, 
then from the definitions it can be shown directly that for any "small" set C, 

hm i log(P^(x, C)) = A(a) = - log(r«) a.e. xeX, 

n— »oo 

where P" denotes the n-fold composition of the kernel Pa with itself. Prom this, the "pinned" 
multiplicative mean crgodic theorem (14) is easily obtained. The drawback to this approach is the 
restriction imposed by the small set C in (14). As wc will see, this restriction is not necessary when 
$ is geometrically ergodic. Nevertheless, in the case of "first-order" large deviations (as opposed 
to more precise estimates as in (13)), these methods provide what appear to be the most general 
large-deviations results to date [9, 12]. 

B. Lyapunov functions and compact sublevel sets. A well-behaved solution to the multiplica- 
tive Poisson equation (10) can be shown to exist under suitable bounds on the transition kernel P. 
For example, (8) will admit a bounded solution fa under the "uniform domination" assumption of [51, 
Sec. 6] : For some £ > and all measurable A cX: 

P{x,A)>eP{y,A), x,yeX. (15) 

Condition (15), as well as its variants in [28, 16, 30, 13], are significantly stronger than geometric 
ergodicity, and are rarely satisfied for non-compact state spaces. In particular, they imply that the 
process is Doeblin recurrent, a property that is equivalent to geometric ergodicity with a bounded 
Lyapunov function V; sec [37, Chapter 16]. 

Similar conditions are used in Donsker and Varadhan's classic papers; see [52] for a general expo- 
sition. Variations on their assumptions are used throughout the large deviations literature (including 
the recent work by Wu - see [56] and the references therein), and they all imply the validity of a con- 
dition stronger than geometric ergodicity, the multiplicaUve regularity condition (mV3), stated and 
discussed in Section 2.2. In particular, Varadhan in [53] assumes directly that (mV3) holds. 

C. Spectral gap. In all of the aforementioned works, only the case where a G R is considered. 

Specifically, the positivity of the semigroup generated by the kernel Pa in (9) is exploited in construct- 
ing solutions (Xa, fa) to the eigenvalue problem (10). Nagacv in [40] treats the special case of ergodic 
Markov chains that converge to the stationary distribution at a uniform geometric rate, 

\P\x, A) - 7t{A)\ < Boe-''°\ for all x = $(0), all measurable A c X. (16) 

This condition is equivalent to Doeblin recurrence. A version of the multiplicative mean ergodic 
theorem is proved, under (16), for purely imaginary a = icj in a neighborhood of zero. The gist of 
this approach is to formulate the problem in a vector-space setting similar to that considered here. 
Noting that the transition semigroup {-P"} of the Markov chain converges in operator norm to the 
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invariant probability measure (as n — > oo, where P"^ is viewed as a linear operator from — > Loo), 
the continuity of the norm is exploited to obtain convergence of the semigroup {P^}- 

Operator-theoretic approaches have been extensively used in the classical theory of Markov chains, 
and the assumption of uniform geometric ergodicity (16) is traditionally used to ensure a spectral gap, 
and hence convergence, as in [40]. Generalizations have typically involved an alternative vector-space 
setting, such as an Lp space for p < oo; see [54, 29, 4] and also [22, 21]. In particular, under 
the assumption of hypercontractivity, Dcuschcl and Stoock [14] derive large deviations properties for 
Markov chains. Note that, as hypercontractivity implies L2-ergodicity at an exponential rate, it also 
implies that (V4) holds [37]. 

In a different vain, in [37, 38] the weightcd-L^ space is considered, 

Ll:={g:X^C : sup[\9{x)\/V{x)] < 

X 

with X — > [1, oo) being the Lyapunov function in condition (V4). The convergence of the semigroup 
{P*^} in the induced operator norm on this space is equivalent to geometric ergodicity [37, 38], and 
based on this equivalence we show in this paper that (V4) leads to multiplicative mean ergodic theorems 
of the type (6)-(8) for complex a, and also to criteria for the existence of solutions to (10) under 
conditions far weaker than those used in, for example, [47, 52]. We also substantially strengthen the 
conclusions of both [47] and [42, 43] since we can apply the F-uniform ergodic theorem of [37] to 
obtain uniform geometric convergence in (7). 

In earlier work related to the ergodic theory of Markov processes (as opposed to the multiplicative 
ergodicity and large deviations issues considered here), Kartashov considered weighted norms in [31, 
32] , and a version of the F-uniform ergodic theorem for countable state space chains first appeared in 
[25]. 

D. Nonlinear semigroups. For a continuous-time Markov process $ (typically a diffusion), Flem- 
ing [20] and Feng [18] consider a nonlinear operator H defined as a modification of the generator A of 
the process 

n{G) := log{{g-')Ag) , where g = e^. 

For any function F G L^, the multiplicative Poisson equation is given in continuous time as 
Af = exp(— F + A)/, where A = A(l) [recall the definition of A(-) in (6)]. If y = / is a solution for a 
given F, then 

n{G) = log[i9-')e''-^9] = -F + A. 

Define the functional Q on as G{F) = log(c/), where / solves the multiplicative Poisson equation 
and c = 7r(/)^^ is a normalizing constant. The operator Q is an inverse of —7i in the sense that 
HoQ = —I on some appropriately defined domain. 

Under (V4), the results of the present paper imply that ^ is a bounded nonlinear operator, whose 
domain contains an open ball in Lqo centered at the origin. In particular, our results provide methods 
for verifying the structural assumptions of [18, 19]. A thorough investigation of this nonlinear struc- 
tural theory and its intimate relationship to large deviations properties is carried out in the subsequent 
work [33], for Markov processes satisfying the stronger assumption of multiplicative regularity. 
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Organization. The rest of the paper is organized as follows. In Section 2 we collect the basic 
notation and definitions that will remain in effect throughout the paper. We present background 
results from the ergodic theory of Markov chains and processes, and briefly discuss several different 
conditions for ergodicity and the relationships between them. 

In Section 3 we collect some results about the convergence parameter of a positive semigroup. 
Section 4 develops the spectral theory and multiplicative ergodic theory along the lines discussed 
above. Analogs of (6)-(8) arc proved for geometrically ergodic Markov processes. 

Sections 5 and 6 contain the probabilistic results outlined in Section 1.2. Finally in Section 7 
we give numerous examples of Markov chains and processes satisfying the assumption of geometric 
ergodicity. 



2 Ergodicity 

In this and the following section we review some necessary background results from certain parts of 
the ergodic theory of Markov chains and processes [37] , and some results regarding the convergence 
parameter of a positive semigroup as defined in [45] . All of this concerns a li'-irrcducible and aperiodic 
chain or process $ on a general state space X (see below for precise definitions). We assume that X is 
equipped with a sigma-field B, and that B is countably generated. The distribution of $ is described 
by a transition semigroup {P* : t G T}, where T is taken to be either the nonnegative integers 
(in discrete-time) or the nonnegative reals M+ (in continuous-time), and where for each t, P* is the 
transition kernel 

P*(x, A) := Pr{$(t) G A \ $(0) = x}, xeX,AeB. 
Recall that P* acts on functions / : X — > M and signed measures v on B, via 



PV(-)= / P\;dy)fiy) and i^P\-) = [ u{dx)P\x,-), 



(17) 



respectively. 



2.1 ^/^-Irreducibility 

For any ^ > 0, we defined the resolvent kernel Rq by, 

/ oo 







5^(1 - e-^)e-'^"P" discrete-time 
9e~^^P* dt continuous-time. 



Re-= { 



L J[0,oo) 

and we write R for Pi . 

If for some cr-finite measure ip on B, some ^ > 0, and all functions s : X 
J s{x)ijj{dx) > 0, we have 



R0{x,s):= / R0{x,dy)s{y) > 0, x G X, 
Jx 



(18) 



[0, oo) with ijj{s) 



then the semigroup {P* : t G T} is called -irreducible, and is called an irreducibility measure. If the 
transition semigroup {P*} associated with the Markov process * is ■0-irreducible, then we say that * 
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is ■i/'-irreducible. The set of functions s: X ^ M_|_ with ip{s) = J s{x) ip{dx) > is denoted by 13~^, and 
all such s are called -i/i-positive. 

Throughout the paper, we will assume that $ is 'i/'-irreducible. Moreover, without loss of generality 
we assume that ip is maximal in the sense that any other irreducibility measure ip' is absolutely 
continuous with respect to tp [37]. We will also assume that the semigroup {P* : i G T} is aperiodic, 
that is, for any s G B'^ and any initial condition x, 

P^{x, s) > for all t sufficiently large. 

If the semigroup associated with the Markov process $ is aperiodic, then we say that $ is aperiodic. 

A measurable subset C of X is called full if ip{C^) = 0, and it is called absorbing if Re{x, C^) = 
for x G C (for some 9). We recall that, for a i/'-irreducible a non-empty absorbing set is always full 
[37, Proposition 4.2.3]. 

A function s G 23+ and a measure v on B are called small if, for some ^ > 0, 

Re{x,A)>s{x)v{A), xeX,AeB. (19) 

In [37, Proposition 5.5.5] it is shown that for a ■i/'-irrcducible one can always find a 9 and a 
pair {s,u) satisfying the bound (19), such that s{x) > for all x, and with ly equivalent to the 
maximal irreducibility measure i/j (in the sense that they are mutually absolutely continuous). A 
similar construction works in continuous-time as well. 

If a small function s is of the form s = eIq for some £ > and C G B, then the set C is called 
small. We denote by Bp the set of all small functions s G B^, and we denote by Mp the set of all 
(positive) small measures v which satisfy (19) for some s G B^. Both Ai^ and Bp are positive cones, 
and they are closed under addition. 



2.2 Ergodicity Conditions 

Let V : X ^ [0, oo] be an extended-real valued function, with V{xo) < oo for at least one xq G X. Let 
Sv denote the (nonempty) set: 

Sv = {x : V{x) < oo}. (20) 

In most of the results below our assumptions will guarantee that Sy is absorbing, hence full, so that 
V{x) < oo a.e. [ip]. 

Let denote the vector space of measurable functions /i : X — C satisfying 

llr II 1^(2^)1 

\\h\\v:=sup'-^ <oo. 

Similarly, l4> will denote the corresponding space for an arbitrary nonnegative (measurable) function 
/ on X. We define the V-novm |||-P|||y of an arbitrary kernel P = P{x, dy) by 

My:=sup^^, (21) 

where the supremum is over all h G with 7^ 0. 

In what follows, it will be convenient to describe some important properties of $ in terms of its 
generator A rather than in terms of its transition semigroup {-P*}. For a function </ : X ^ C, we write 
Ag = h li for each initial condition $(0) = x G X the process {m{t) : t G T} defined by 

m{t):= [ hms))ds-g{^{t)), t G T, (22) 

J[o,t) 
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is a local martingale with respect to the natural filtration {J^t = a{^{s), < s < t) : t & T}. In 
discrete-time the generator is simply A = P — I. 

Next we introduce two different regularity conditions on ^, taken from [37]. As we will see, the 
first one guarantees the validity of ergodic results as in equations (3)-(5), whereas the second one will 
be used to prove their multiplicative counterparts (6)-(8); see Section 4. 

Throughout this paper we assume that the function V is finite for at least one x G X. 



For a function / : X — > [l,oo), a probability measure 
v on B, sl constant b < oo, a function s : X — ^ (0, 1], 



AV < -f + bs 



and a F : X ^ (0,ool: ^ (V3) 

For a probability measure v on B, some constants 

6 < oo and J > 0, a function s : X (0, 11, and a .^r ^ 

^ ' ^ R > s u. ^ ^ 

Note that condition (V4) is stronger than (V3): When (V4) holds, (V3) also holds with f = V, 
V' = and b' = b/5. The assumption that (V4) holds for a Markov chain * is the main condition 
required for most of our results, and it will remain in effect essentially for the rest of the paper. To 
formalize this assumption we introduce the following definition: 

Geometric Ergodicity. A Markov process * is called geometrically ergodic (with Lya- 
punov function V), if it is -^-irreducible, aperiodic, and it satisfies condition (V4) (with 
this V). 

In Section 7, numerous examples are given for which the validity of (V4) is explicitly verified; see 
also [37, Chapter 16]. For comparison, we also introduce the following related condition for continuous 
time Markov processes, which we think of as the natural multiplicative analog of (V3): 

For a function / : X ^ [1, oo), a probability measure 

u on B, constants 5 > and b < oo, a function s : , f a v\ ^ 
X-. (0,1], anday :X^ [l,oo]: ""^V "^^J - '^J^^^ 

R > s (g) z/. 

(mV3) 

As discussed in the introduction, condition (mV3) is very closely related to the conditions in the 
well-known Donsker-Varadhan large deviations results. In particular, under the conditions of [52], 
especially Assumption (3) in [52, p. 34], it follows from Theorem 3.3 below that (mV3) is satisfied. 
Moreover, in the general case where the state space X is not compact, Varadhan's conditions imply 
that (mV3) holds with an unbounded / with compact sublevel sets, an assumption already stronger 
than (V4) as the following Proposition shows. A detailed study of Markov processes satisfying (mV3) 
is given in [33], where the analog of (mV3) for discrete-time Markov processes is also given. In the 
context of diffusions, more specific results can be found in [27]. Proposition 2.1 is proved in [33]. 

Proposition 2.1 Suppose ^ is ip -irreducible and aperiodic. If (mV3) holds, then so does (V4)- 



9 



2.3 Ergodic Theorems 

Under either (V3) or (V4), there exists a unique invariant probabihty measure n on B (see below). 
Given such a tt, we define 11 as the kernel 

n = l(g)7r, 

so that Il{x, A) = Tr{A), x eX, A e B. If Tr{V) := Tr{dx)V{x) < oo, then 11 acts on as a bounded 
linear operator. 

A fundamental kernel is a linear operator Z: lIo (for some measurable functions / > 1, 

V > 1), satisfying 

AZ = -{I-U). (23) 
That is, for any F G lIo, the function F = ZF G solves the Poisson equation, 

AF = -F + Tr{F), (24) 

where 7r(F) = J.^Tr{dx)F{x). Equivalently, the stochastic process 

m{t) = F{^t)) - F($(0)) + / iF{^{r)) - 7r(F)) dr, t > 0, (25) 

J[o,t) 

is a martingale with respect to {J^t}- 

The following two theorems give equivalent conditions for ^ to be ergodic or geometrically ergodic, 
respectively. Corollary 2.3 states that a fundamental kernel exists, and the ergodic results (3)-(5) given 
in the introduction indeed hold as soon as $ satisfies (V3). 

For any C & B, let tc denote the hitting time 

Tc := inf{t > 1 : ^{t) € C}. 

Theorem 2.2 (Ergodicity) Suppose that * is -irreducible and aperiodic. For any function / : X — ^ 
[1, oo) the following are equivalent: 

(i) The process * is positive recurrent with invariant probability measure tt, and 7r(/) < oo. 

(ii) There exists a small set C such that 



sup 
x&C 



fm))dt 

[0,TC) 



< OO . 



(iii) Condition (V3) holds with the same f. 

If any of these conditions holds, then the set Sy defined in (20) is absorbing and full, and 

sup |P*(x,5) - 7r(5)| -> 0, i^oo, xeSv- (26) 
9-\9\<f 

Moreover, for any small measure v there exists a fundamental kernel Z which is a bounded linear 
operator, 

Z:Li^{heLl : u{h)=0}. 
If Z' is any other such fundamental kernel, then \\ZF — Z'F\\v = 0, F E lIq. 
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Proof. The discrete-time version of (i)-(iii) is a consequence of the /-Norm Ergodic Theorem 
of [37], and the continuous-time version follows from [35, Theorem 5.3]. The construction of the 
fundamental kernel and the uniform bound is given in [23, Theorem 2.3]. □ 

Recall the definition of St in (1). 

Corollary 2.3 (Ergodic Theorems) Let ^ be a tp -irreducible, aperiodic Markov process that satisfies 
(V3). IfFeL^ with f as in (V3), then for xeSy: 

(a) E^[|5i] ^7r(F) as t ^ oo. 

(b) There exists F G with Tr{F) = 0, so that F solves the Poisson equation 

AF = -F + Tr{F). 

(c) //, in addition, 7r(F) < oo, then F satisfies 

F{x) = lim E^lSt-tiriF)]. (27) 

t— >oo 

Proof. The convergence in norm (26) implies the convergence, 

E^[F($(t))] ^7r(F), t^oo, xeSv, 

which gives (a). For (b) we can take F = ZF, where Z is given in Theorem 2.2. 
When Tr{V) < oo it follows that, for some 6i < oo, 



/ 



E^[F($(t))] - 7r(F) dt < bi\\F\\fV{x) , x e Sy ■ 



This is given as Theorem 14.0.1 of [37] in discrete-time. The continuous-time case follows on consid- 
ering the skeleton chain ^{6k), k = 1,2, 3, . . ., as discussed on p. 247 of [36]. This implies that one 
version of the fundamental kernel may be expressed as 

Z{x, A)= [ {P\x, A) - Tr{A)) dt , xeSy, AeB, 
Jt 

and Z is a bounded linear operator from l[o to L^. This gives (c). □ 

The solution F of the Poisson equation given in Corollary 2.3 (b) arises in almost every limit 
theorem considered below. In particular, it can be used to define the asymptotic variance in the 
central limit theorem; see Theorem 17.4.5 of [37]. In the discrete-time case, cr^ < oo as soon as 
7r(F^) < oo, and equation (17.44) of [37] gives the representation. 



= lim -Var^{5„} = F($(n))^ - (PF($(n)))^ 

n— >oo n 



X G Sy. 



Next we obtain a characterization of the case when = 0. In discrete time, a similar result is 
derived in [3] using different methods. 
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Proposition 2.4 (Variance Characterization) Suppose that $ satisfies (V3) with vr(y^) < oo. Then, 
for any F G Loo, the asymptotic variance 

:= lim jVar^^t} (28) 

t—^oo t 

exists for any initial condition $(0) = x G Sy- Writing F = F — 7r{F), cr^ satisfies, for all t > 0, 

2" 



F($(t)) -F($(0)) + / F{^{s))ds 



< oo . (29) 



Moreover: 

(i) If = then there exists G G L^, satisfying, 

I F($(s))ds = G($(i))-G($(0)), a.s. [tt]. (30) 
J[o,t) 

When time is discrete, this can also he expressed as P{x, Sx) = 1, x E Sy, where -S^ = {y G X : 
G{y) = G{x)-F{x)}. 

(ii) Conversely, if (30) holds for some G G L^, then = 0. 

Proof. Result (i) is an immediate consequence of (29), which follows from the martingale charac- 
terization of F (see (25)). Result (ii) is immediate from (28). □ 

Theorem 2.5 (Geometric Ergodicity) Suppose that ^ is ip -irreducible and aperiodic. The following 
are equivalent: 

(i) There exists a probability measure tt and a V.X — ^ [l,oo], such that P* converges to tt in the 

V-norm, 

P* ^ 1 (g) vr, t^ oo. 

(ii) There exists a small set C and e > such that 



sup 



exp ere 



< oo. 



(iii) Condition (V4) holds for some FrX — >■ [l,oo]. 

// any of these conditions holds, then the set Sy defined in (20) is absorbing and full for any function 
V satisfying (iii), and there exist constants bo > 0, Bq, B'q < oo and an invariant probability measure 
TT on B, such that 

IIP* - 1 7r|||y < Soe-^o*, iGT, 
\^x[St-tT^{F)]-F{x)\ < B'o\\F\\ve-^°' , F e ,x e Sy ,t eT . 

Proof. The discrete-time result is [37, Theorem 15.0.1], and the continuous-time version is the 
main result of [15]. □ 
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3 The Generalized Principal Eigenvalue 



As in the previous section, wc assume that $ is a geometrically crgodic Markov process. We fix 
throughout this section a bounded (measurable) function F : X — > R and a real number a G M, and 
we define the semigroup {P^ : t G T} by 

Piix, A) = E,[exp(a5t)I^($(i))], t G T , (31) 

where St is defined as before by (1). In this section we consider the general properties of this positive 
semigroup; we therefore suppress the dependency on a and F and simply write {-P*} for {P^ : t G T}. 

We say that an arbitrary positive kernel P is probabilistic if P{x,X) = 1 for all x EX. Similarly, a 
semigroup {P*} is called probabilistic if P* is probabilistic for all t. Clearly, the semigroup {-P*} = {Pa} 
is, in general, non-probabilistic. The definitions of ^/>-irreducibility and aperiodicity carry over to non- 
probabilistic semigroups immediately. [Further extensions to kernels defined for complex numbers 
a G C will be treated in Section 4.] Note that {-P*} is irreducible and aperiodic as soon as {P*} is. 
We also define a family of resolvent kernels Rg for 6 > exactly as in (18), with P* in place of P*. 
To ensure that these arc finite for all a; G X and a suitable class of A G fi, we usually consider ^'s in 
the range 6 > |q;|||P||oo- (And as before, we write P = Pi.) With Rq replacing Rq, the definitions of 
small functions, measures, and sets carry over verbatim. 

Finally, we define the generator A of the semigroup {P* : t G T}: We write Ag = /i if 

ptg (x) = g{x) + / P'h (x) ds, teT, xe X. (32) 
J[o,t) 

The following resolvent equations will play a central role in a lot of what follows: 

ARe = {e^ — l){Ro — I) discrete-time 

ARe = 6 {Re — I) continuous-time 

In continuous-time, the resolvent equation can be used to establish the following identity, whenever 
the sum and integral converge absolutely, 

oo „ 

n=l •^[O'"^) 

This is tremendously valuable in consolidating continuous- and discrete-time theory. 

Since the semigroup {P*} is -(/'-irreducible, the kernel Rq satisfies the following minorization con- 
dition: There are s G and u G M.'^ such that 

Re > s ® V. 

[Note that, since the semigroup {P*} is derived from {P*}, the above domination condition is satisfied 
with s and v that are small with respect to {P*}.] Let {k„ : n > 1} denote the positive sequence 
defined by: 

nn = v{ReT-'^s, n>l. 

This sequence is supermultiplicative, 

> i^iReT-Hs <S> iy){Rer-h 
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so there exists some L{6) G (— cx), oo] such that 

i log{iyRp) = i log(K„+i) ^ L{e), n^oo. 

The constant 

re := cxp(-L(0)) (35) 
is caUed the convergence parameter for the kernel Rg [45]. It satisfies: 



n=0 



= OO, for all X eX, if r > ro 

< oo, for a.e. x G X [■^], if r < rg. 



To move from the resolvent back to the original semigroup, we apply the resolvent equations (33). 
These relations establish the major part of the following theorem. 

Theorem 3.1 (Generalized Principal Eigenvalue) Suppose * is ip -irreducible and aperiodic. Then 
there is a Xo E (0, oo] such that, for any s G : 



(i) 



/ X-^P^s{x)dt { 

JT 



OO for all X eX, A < Ao 

(36) 

< oo for a.e. X G X A > Ao- 



(ii) ^ log(P*s(x)) Ao := log(Ao) a.e. xeX t ^ oo . 

Proof. Result (i) follows from Theorem 3.2 of [45] for discrete-time chains, and from (34) for the 
continuous-time case where we may translate to the discrete-time case using the resolvent Rg. Then 
with Xg = r^^, 

Ao = -I- (1 — e^) Art discrete-time 

_i (37) 
Ao = exp(0(l — Xg )) continuous-time. 

The second part follows from an argument similar to that used in the proof of Lemma 3.2 of [2]. □ 

We call the constant Ao the generalized principal eigenvalue (g.p.e.) of the semigroup {P* : t G T}. 
This generalizes the corresponding definition of [47], and, as we will see in Theorem 3.3 below, Ao does 
indeed play the role of an eigenvalue. The interpretation of (ii) is the "pinned" multiplicative mean 
ergodic theorem (14) discussed in the introduction, 

i log E^[exp(a5t)Ic($(t))] ^ A(a) , t ^ oo , (38) 

for a.e. x G X [tp]. This follows from taking s = elc in (ii) with C small, and £ > 0. 

Theorem 3.1 leaves open what happens in (36) when A = Ao. The semigroup {-P*} is called: 

(i) Xo-transient if 

X~^vP*sdt < oo 



J 



(ii) Xo-recurrent if 

X~*uPhdt = oo 



JT 
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(iii) Xo-geometrically recurrent if the function 

(Ao -z) [ {vPh)z'*dt 
Jt 

is analytic in a neighborhood of z = Ao- 

In (i)-(iii), {s, u) is any pair with s G i3+ and v G M.p- The particular small function or small measure 
chosen is not important [45]. 

The construction of h in part (ii) of the following Lemma is an extension of the minimal harmonic 
function in [45, Proposition 3.13], where here we allow the semigroup {-P*} to possibly be transient. 

Lemma 3.2 (i) Suppose that {P* : t G Z+} has g.p.e. Ao < oo, and suppose that the following 
minorization condition holds for some s G and v G Mp : 

P>s®v. 

Then, 

oo 

^ A-*-V(P - s (g) i/)*s < 1, 

t=0 

with equality if and only if the semigroup is Ao -recurrent. 

(ii) // in (i) we take {P* : t G Z+} to he the probabilistic semigroup {P* : t G Z+j, then, 

h{x):=^^{P-s0u)''s{x) < 1; 

J2'^{P - s^iy)''Ps{x) = -s{x) + {l + iy{s))h{x) <2, for all X ex. 

In this case, if {P^ : t G Z+} is 1-recurrent, h{x) = 1 for a.e. a; G X [ip]. 
Proof. Part (i) is Proposition 5.2 of [45]. The essence of this result is the inversion formula, 



Iz 



Iz- (P- s(^u)\ + -^—s^f) (39) 
J V 1 — «; / 



where 

K, = v[Iz - {P - s® v)\~^s . 
Prom (39) it may be seen that, for z > 0, 

K = 1 if and only if u[Iz — P]~^s = oo. (40) 

The proof of (ii) is by induction. Define, for n > 0, 

n 
t=0 

For n = 0, ho = s and s < 1 by assumption. If true for n, then 

hn+i (x) = (P - s (gi iy)hn (x) + s{x) 
< {P - s (S> 1^)1 (x) + s{x) 
= [P{x,X) - s{x)u{X)] + s{x) 

= 1, 
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where in the last equation we have again used the fact that s < 1. It fohows that h{x) = hm hn{x) < 1 
for all X. 

To see the second bound, write 

(P - s ® z/)"Ps = (P - s (g) i/)"+^s + (P - s (8) (g) u]s. 

Summing over n gives the desired result. □ 

The reason we call the constant Ao a generalized eigenvalue is clarified by the next theorem, where 
it shown that, if the semigroup {P* : t G T} is Ao-recurrent, then there is a function / : X — >■ [0, oo) 
so that (/, Ao) solve the eigenvalue problem, 

Pf = Xof. (41) 

Equation (41) is an instance of the multiplicative Poisson equation. Conditions for the existence of 
a solution to (41) based upon Lemma 3.2 (i) are well-known in the discrete-time case. A candidate 
solution is given by 

oo 

f = Y,r'^Re{Re-s0urs, (42) 

n=Q 

where ({?,s,i/) satisfy 6* > Ao, s G B^, u G M.p, Re > s ® v, and rg is the convergence parameter 
defined in (35). 

Theorem 3.3 (Existence of an Eigenfunction /) Suppose that $ is i}) -irreducible, and that the g.p.e. 
Ao of the positive semigroup {P*} is finite. Then the function f given in (42) is finite a.e. [ip], and 

(i) If {P* : t &T} is Xo-recurrent then f solves the multiplicative Poisson equation: 

P'f = Xif, teT. (43) 

(ii) If {P^ : t gT} is Xo-transient then for any small function s G Bp , there exists (5 > such that 

Pf = Xof-ds. (44) 
Hence, in the Xo-transient case there is a solution f to the pointwise inequality 

Pf < Ao7 (45) 
with f finite a.e. and where the inequality is strict whenever f{x) < oo. 

(iii) The solution (42) is minimal and essentially unique: If f:X ^ (0, oo) is any solution to the 
inequality (45), then there exists c G M+ such that f{x) > cf{x) for all x. 

If {P^ : t eT} is Xo-recurrent, then we have f = cf a.e. [tp], and f{x) > cf{x) for all x. 

Proof. These results are all based on Theorem 5.1 of [45] in the discrete-time case. 

If > is taken large enough, then the resolvent Rg satisfies vRqs < oo for any small s, v satisfying 
the domination condition Rq > s®v. We then set 



fe = Y.^''eiRe-s®yrs, 



n=0 
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where rg is the convergence parameter for Rg. We have 

roiRe - s (8) iy)f0 = fe- s, 

and hence 

/ := Refe = rg^fd - Ses, 

where 60 = Vg^ — v^fe) > 0. This constant is strictly positive if and only if the semigroup {R!q} is 
r^^-transient (see Lemma 3.2 (i)). 

Results (i)-(iii) then follow from the resolvent equation in discrete or continuous-time. □ 

4 Spectral Gap and Multiplicative Mean Ergodic Theorems 

The following assumptions will be held throughout the remainder of this paper: 



(i) The Markov process $ is geometrically ergodic with a 

Lyapunov function F : X — > [l,oo), such that 7r(F^) < 00. 

(ii) The (measurable) function F:X ^ [—1, 1] has zero mean 

(46) 

7r(F) = 0, and non-trivial asymptotic variance 
:=limtYaT^{St/Vi}>0. 



Note that the additional assumption 7r(y^) < 00 can be made without any loss of generality: (V4) 
implies (V3) with f = V as discussed above, which implies that 7r(/) < 00 [37, Theorem 14.0.1]. 
Moreover, Lemma 15.2.9 of [37] says that (V4) also holds with respect to ^/V (and some, possibly 
different, small function s), so we can always take V in (V4) such that 7r(F^) < 00. 
Until Section 4.2 we specialize to the discrete-time case for the sake of clarity. 

With V as in (46), the spectrum S{P) C C of a bounded linear operator P'-L'^ is defined 

to be the set of nonzero A G C for which the inverse (/A — P)~^ does not exist as a bounded linear 
operator on L^. 

Recall that an arbitrary kernel P acts on functions (on the right) and on signed measures (on the 
left) as in (17). With that in mind, we think of a kernel P as an operator acting on a appropriate 
function space. The kernel P is a bounded linear operator on provided its V-norm jjjPjjjy is finite, 
since this is precisely the induced operator norm. For an arbitrary linear operator P: — >■ we 
continue to define the norm |||-P|||y as in (21). Also we recall that P acts on a suitable space of measures 
(on the left) as 

i^P{A):=iy{PlA), AeB. 
For a eC the kernel Pa defined in (31) yields an operator Pa'. — > acting via 

Pag (x) = exp{aF{x))Pg (x), xeX, geL^. (47) 

Its spectrum is denoted Sa = S{Pa)- The n-fold composition of the kernel Pa with itself acts on 
as 

P^g (x) = E^[exp(a5„)5($(n))], x G X, 5 G L^, n > 1, 
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where {Sn : n > 1} denote the partial sums (2). Letting JW\ denote the space of signed and possibly 
complex-valued measures /j, satisfying < oo, we obtain analogously, 

HP^ (A) = J E^[exp(aS„)I($(n) G A)] n{dx), AeB, fiE MY , n > 1 . 

In this section we identify a region $7 C C such that, for geometrically Markov chains, eigenfunctions 
fa G and (positive) eigenmcasures jla £ -M-Y exist for P^, corresponding to a given eigenvalue 
Aq G Sa and a E ft. Suppose that such fa, p,a are found, and assume that they are normalized so that 

/ia(/a)=Aa(X) = l. (48) 

We then let Qa- —>■ denote the operator Qa = fa'^ f^a, 

Qag{x) = fta{9)fa{x), ^GL^, XGX. 

Note that Qa is a projection operator, that is, = Qa- 

The main results of this section are summarized in the following two theorems. In particular, 
the multiplicative mean ergodic theorem given in (50) will play a central role in the proofs of all the 
subsequent probabilistic limit theorems. 

Theorem 4.1 (Multiplicative Mean Ergodic Theorem) Assume that the Markov chain * and the 
functional F satisfy (46)- With 5 and b as in (V4), define: 

Then there exists aJ > such that, for any a in the compact set 

J7 = {a = a + G C : |a| < a, and |a;| < a;}, 

there is an eigenvalue Xa G Sa which is maximal and isolated, i.e., 

\Xa\ = max{|A| : A G Sa} and 5^ n {z : \z\ > \\a\ - So} = {Xa} 

for some 5o > 0. 

Moreover, for any such a, there exist fa G -L^ and fia £ MX > satisfying (48), and: 

(i) The functions fa solve the multiplicative Poisson equation 

Pafa ~ ^afa-i 

and the jla o-f^ eigenmcasures for the kernels Pa •' 

(ii) There exist constants bo > 0, Bo < oo, such that for all a E il, x eX, n> 1, 

E,[exp(Q5„ - nA(Q))] - /,(x)| < Bo\a\V{x)e-''"'' , (50) 

where A(a) := log(AQ) is analytic on ft, and Sn are the partial sums defined in (2). More 
generally, for any g G L^, 

E^[exp(aS„ - nk{a))g{^{n))]-Qag (x) < Bo\\g\\vV {x)e-'^'' . 
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Proof. The existence of an isolated, maximal eigenvalue Aq, is given in Proposition 4.12. It is 
nonzero for a = a G [—a, a] by Proposition 4.3, and since it is analytic in a (by Proposition 4.12), we 
can pick a; > small enough such that Aa is nonzero on ^2. 

The existence of an eigenfunction and eigenmeasure as in (i) follows from Proposition 4.12 combined 
with Proposition 4.8. To see that a > note that, under (V4), 

7r(y) = 7r(PF) < (l-(5)7r(F) + 67r(s). (51) 

Hence, h > 6TT{V)/ir{s) > S. 

To prove the limit theorems in (ii), consider the linear operator 

From Proposition 4.8 wc can find eq > such that C^(2,a) is an analytic function of two variables 

{z, a) = (z,a + iio) on the domain 

V = {\z\ > 1 — £0) |a| < a + £0) l'*^! < w + £o}. 

We may also assume that eo > is suitably small so that, for some 6 < oo, we have |||^/(2,o!)|||y < h for 
all (z, a) £V. 

Set 6o = — log(l — eo) > 0. The following bound then holds for all (z, a) &T>, g & L^, x G X, and 
n > 1, by representing U(^z,a) as a power-series, and using the fact that Qa is a projection operator: 

bV{x) > |/o"e'^"+'^'^t^(exp(-6o+m0),a)5(^)# 

= 6(^^+1)^0 1 (A-ip,-Q„)"(7(x) I 
= e(^+^>o\\-^P^gix)-Q^gix)\. 

This gives the second bound in (ii). The first one follows from the second since, when a = and 

\X-^PSg{x)-Qagix)\ = 0, 

for all ra > 1, X G X. □ 

Next we give a weaker multiplicative mean ergodic theorem for all a = a + iu; in a neighborhood 
of the zw-axis. A function F : X — M is called lattice if there are h > and < d < h, such that 

Fix) -d . . 

IS an integer, x G X . (52) 

The minimal h for which (52) holds is called the span of F. If the function F can be written as a sum, 

F = Fo + Fe, 

where F^ is lattice with span h and Fq has zero asymptotic variance (recall (28)), then F is called 
almost-lattice (and h is its span). Otherwise, F is called strongly non-lattice. 

Although these definitions are somewhat different from the ones commonly used when studying 
the partial sums of independent random variables, in the Markov case they lead to the natural analog 
of the classical lattice/non-lattice dichotomy. This dichotomy, which is close in spirit to the discussion 
in [50], is stated in Theorem 4.13. 
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Theorem 4.2 (Bounds Around the iw-Axis) Assume that the Markov chain $ and the functional F 
satisfy (46)- 

(NL) Suppose that F is strongly non-lattice. For any Q < ojq < loi < oo, there exist bo > 0, Bq < oo 
(possibly different than in Theorem such that 

E^[exp{aSn - nA{a))] < SoF(x)e-^"" , x G X, n > 1, (53) 

for all a = a-\- iu with \a\ < a and loq < \uj\ < ui. 

(L) Suppose that F is almost-lattice with span h > 0. For any e > 0, there exist bo > 0, Bq < oo 
(possibly different than above and in Theorem 4-l)j such that (53) holds for all a = a + iui with 
\a\ < a and e < < 27r/h — e. 

Proof. By Theorem 4.13 we have the bound < = for the range of a G C considered in 
the theorem. This imphes that there is an £i > 0, 6i < oo such that 

\l[Iz-e-^'^P^]-%<bi 

for all l-^l > 1 — £i, and all a in this range. An argument similar to the proof of Theorem 4.1 (ii) then 
gives the desired bounds. □ 

4.1 Spectral Radius and Spectral Gap 

Recalling our standing assumption (46), we fix the Lyapunov function V : X ^ [l,oo) throughout this 
section. 

For complex a we wish to construct A{a) G C satisfying the multiplicative mean ergodic limit, 

A(a) = hm ^ log Ej:[exp(aS'„)], x eX. 

ra— »oo 

This requires a generalization of the notion of the g.p.e. of Section 3. The previous definition is 
meaningless when a M, since the definition of a small set depends on the linear ordering of M. 

Spectral radius. For a bounded linear operator P: we define the spectral radius of P by 

I := lim (|||P«|||^)Vn = exp( lim i log . (54) 

n— »oo Vn— >oo " / 

Note that in the above definition P is not assumed to be a positive operator, and it is possibly 
complex- valued. Since ||| • |||y is an operator norm, the sequence {log(|||P"'|||y) : ra > 1} is subadditive 
[48]. Therefore ^ always exists, although it may be infinite. 

We let denote the spectral radius of the operator Pa defined in (47). When a = a is real, 
from the definitions we have that ^„ > where A« is the g.p.e. of the positive kernel Pa- One of 
the main goals of this section is to show that the spectral radius ^q, coincides with Aq for real a in a 
neighborhood of a = 0. We first establish upper and lower bounds: 

Proposition 4.3 Under (46), the spectral radius ^a of Pa is finite and 

L < (6 + l)exp(|a|), 

for all a = a + iuj E C Moreover, for a = a G M, 

ia > e-" > 0. 



20 



Proof. The function s in (V4) is necessarily bounded by one. Consequently, under (V4) we have 
for any g G L^, a = a + iu E C, 

\Pa9 (x)| < eM\a\)\\9\\vPV < exp(|a|)||(/||y (1 + b)V . 

This implies that |||-Pa|||y < el"l(l + b). The operator norm ||| • \ly is submultiplicative, 

iray<|||Pa|||?^<eH"(l + 6r, n>l, 

giving the upper bound. 

When a = a is real, for any g G L^, g >0, we have, 

P^gix)>e-^^P-g{x). 

It follows immediately that > ^oe~" = e~". □ 

Spectral gap and ^-uniform operators. Recall the following classical result from [48, p. 421]: 

Theorem 4.4 (Decomposition Theorem) Let P : — > be a bounded linear operator, and suppose 
that zq G S{P) is isolated, i.e., for some Eq > 0, 

S{P) nD = {zq} where D = {z e C : \z - zo\ < Eq} . 

Then, the following bounded operator on is well-defined, 

Q = ^ [ [iz-P]-Uz, 

27rz Jqd 

and moreover: 

(i) Q: is a projection operator, that is, = Q; 

(ii) PQ = QP = zoQ; 

(iii) S{Q) = {1}, and S{P - zoQ) n = 0. 

We say that zq G S{P) is a pole of finite multiplicity if zq is an isolated point in S{P) and the 
associated projection operator Q can be expressed as a finite linear combination of some {sj} C L^, 
M C MY: 

n-l 

Q = ^ mij[si (g) Vj]. (55) 

i,j=0 

In particular, we call zq a pole of multiplicity one, if (55) holds for n = 1, and also there exists £o > 
such that 

S{P-zoQ)c{z:\z\<i-eo}, 

where ^ is the spectral radius of P. 

We say that P admits a spectral gap if there exists £o > such that S{P) C) {z : \z\ > ^ — Eq} is 
finite, and contains only poles of finite multiplicity. 

Further, wc say that P is V -uniform, if it admits a spectral gap and also there exists a unique pole 
Ao G S{P) of multiplicity one, satisfying |Ao| = ^. In that case, Ao is called the generalized principal 
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eigenvalue (g.p.e.), generalizing the previous definition. In particular, if Pa is ^-uniform for some 
a eC, then we write for its associated g.p.e. 

Much of the development of this section, is based on properties of rank-one operators of the form 
M = So 2^0 for some sq G L^, vq & M-Y ■ The associated potential operator is defined as 

1 -1 



Iz-{P-M) , zeC, (56) 



whenever the inverse exists. The potential operator is used to construct eigenfunctions and eigenmea- 
sures for a ^-uniform operator: 

Proposition 4.5 Suppose that P is V -uniform with g.p.e. Ao, and that the associated so,fo in (55) 
are chosen so that the potential operator Uz in (56) is hounded for z in a neighborhood of \z\ > |Ao|. 
Then, setting f = U\^so and fi = voUx^, we have f G L^, fi G MX > 

Pf = Ao/ , and fiP = Ao/i . 

Proof. From ^-uniformity we know that there exists Eq > such that the inverse [Iz — P] ^ exists 
and is bounded as a linear operator on L^, for all \z\ > ^ — Sq, z ^ Aq. Moreover, for such z we may 
apply the inversion formula (39) to obtain the identity, 

[/.-Pr = C/, + i^i^°M^^. (57) 

1 - I^qUzSo 

Since Ao G 5(P), and Aq S{P — sq ® '^o): it follows from this equation that lyoUx^so = 1. 
Applying [Iz — P] to (57) on the left, and sq on the right then gives. 

So = [Iz - P\UzSo + [Iz - P\ 



1 - l^oUzSo 



Multiplying both sides by (1 — i'oUzSq), and then setting z = Xo gives = [lAo — PJ^^Ao'^O) which shows 
that / is an eigenfunction. 

The proof that jl is an eigenmeasure is completely analogous, and follows by applying [Iz — P] to 
(57) on the right and uo on the left. □ 

The following proposition provides useful characterizations of F-uniformity. 

Proposition 4.6 The following are equivalent for an operator P with finite spectral radius ^. 

(i) P is V -uniform. 

(ii) There exists A G C satisfying |A| = ^, and Eq > 0, sq £ L^^, uq G M-Y, satisfying 

sup{|z-A||||7z-P|||^ : |z| >|-£o} < oo, 
and sup||||C/2|||^ : |z| > |-£o} < oo, 

where Uz is the potential operator defined in (56), with M = sq® vq. 

(iii) There exists A G C satisfying |A| = ^ and f G -L^, fi & MY , such that 

^-npn ^ f^j^^ n ^ oo, 
where the convergence is in the V-norm. 
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Proof. If (i) holds then the matrix-inversion formula (39) gives 

[IZ-P + Xof ® A]"' = [Iz - P]-' - , ^\ J ® A. 

The left hand side is bounded for \z\ > ^ — £o under (i). Hence we may set f = so and jl = vq to 
obtain (ii). 

The implication (ii) (i) also follows from the matrix inversion formula (39) since (57) then holds 
for all l^l > ^ — £o, z ^ X. This implies the limit 

r ( \\\T Si-i (^A^o) ® ji'oUx) ..gx 
Q := hm {z - A) [Iz - P] = ^ , (58) 

and (i) holds with this Q, and Ao = A. 

The equivalence of (i) and (iii) follows exactly as in Theorem 4.1 (ii). □ 

For a probabilistic kernel P, the following proposition says that F-uniformity implies that the chain 
with transition kernel P is geometrically ergodic. The converse is also true; see Proposition 4.10. 

Corollary 4.7 If P is a V -uniform, probabilistic kernel, then the Markov chain with transition kernel 
P is geometrically ergodic. 

Proof. Since P is probabilistic, applying the limit result of Proposition 4.6 (iii) to the constant 
function 1, implies that A = 1 and that / is constant. By rescaling we can take / = 1 and /x to be a 
probability measure. Prom Theorem 2.5 it the follows that P is geometrically ergodic. □ 

Proposition 4.5 applied to the family of kernels {Pa} gives the following: 

Proposition 4.8 Suppose that P^p is V-uniform for a given ao £ C. Then there exists eo > such 
that Pa is V-uniform (with associated g.p.e. Xa) for all a G C, |a — q;o| < £o- Moreover, for each such 
a there exist fa € and /Xq. G A^j^ such that: 

(i) fa solves the multiplicative Poisson equation, Pa fa = ^afa- 

(ii) jla is an eigenmeasure for Pa, a- 

(iii) The g.p.e. Xa is an analytic function of a, and so is fa{x) for any fixed x G X. 

Proof. The existence of eigenvectors in (i) and (ii) is immediate from Proposition 4.5 when a = ao- 
Define Uz = Uz,a by (56) with P = Pa, and M = so'Sii^o' 

-1 



Uz.a . — 



Iz - {Pa -so0 Uo) . (59) 



Prom y-uniformity wc know that M can be chosen so that Uz^ao is a bounded linear operator for z 
in a neighborhood of Aa,,- Since Pa is continuous in T^-norm, it then follows that Uz^a is a bounded 
linear operator for {z, a) in a neighborhood O of (Aqq, ao). This combined with Proposition 4.5 proves 
(i) and (ii). 

Write ^{zja) = i'o{Uz,aSo), z e C,a e O, so that 



J{Xa,a) = 1, 



dz 



^J{z,a) = MUla>o = i^a{fa)^0, aeO, 



Z—Xa 
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where fa-,i^a are the eigenfunction and eigenmeasure given in Proposition 4.5. We conclude that \a 
is an analytic function by the implicit function theorem. 

The proof that fa{x) is analytic in a for a; G = X follows from the expansion 

oo 

fa = Ux„,aS0 = J2 K''~\Pa - Sq ^ Uo^So. (60) 

n=0 

This expression for fa converges uniformly for a € O, and for each n the finite sum is analytic, which 
completes the proof of (iii). □ 

The eigenfunction (60) will not in general satisfy the required normalization (48). The following 
eigenfunction and eigenmeasure do satisfy this condition, and are the unique such solutions, 

= G fa= eLl. (61) 

Given such /„ and Aq for some real a, we define the twisted kernel Pa by 

Pa{x, dy) = Xa^fa^{x)Pa{x, dy)fa{y) , 

(cf. (11) in the introduction), and we let Va = V/fa- (As we will see below, fa is bounded away from 
zero for real a in the range of interest.) The following proposition describes the relationship between 
the transition kernels, the eigenfunctions, and the eigenmeasures {Pa, fa, jj-a '■ a 

Proposition 4.9 Suppose that Pag is V -uniform for a given real oq. Then there is an open set O C M 
containing ao, such that, for all a & O, with fa,fia given in (61) and with Pa equal to the associated 
twisted kernel, we have: 

(i) The operator Pa is Va-uniform. 

(ii) -^A(a) = log(Aa) = iTaiF), where Wa is the invariant probability measure for Pa- 
da da 

(iii) Fa := log(/a) is a solution to the Poisson equation, 

PaFa = Fa-F + 7ra{F). (62) 

For a = 0, this is the unique solution satisfying Tr{F) = 0. 
(v) F. € L^+'-rtf.). 

Proof. The existence of O follows from Proposition 4.8, and from its proof we know that |||C^Aa,a||li/ < 
oo when a E O, where Uz,a is given in (59). 

The linear operators Pa and Pa are related by the scaling \a and a similarity transformation. 



Pa = X-a\lfJ-'PaIf^, 
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where Ig, for an arbitrary function g, denotes the kernel Ig{x, •) := g{x)6x{-)- Hence Pa is ^-uniform 
if and only if Pa is (/J^F)-uniform. Result (i) immediately follows. 

Ja 

Consider the unnormalized eigenfunction given in (60). Differentiating the expression /„ = U\^^aSo 
and applying the quotient rule gives, 

fa = £[IK-Pa + So^M~^So 

= ~Ux.AlK-lFPa]Ux.,aSO (63) 
= -UxUiK - \aF)fa] = \aK,aIfSF - A' (a)] . 

The right hand side of (63) lies in since F <E L^o, fa G L^, and U\^^a'L^ is a bounded 

linear operator. This proves the first bound in (iv) since the two versions of are related by a smooth 
normalization. The proof that € M-Y is identical. 

Differentiating both sides of the eigenfunction equation gives 

FXafa + Pafa = Kfa + ^afa- 

Dividing this identity by Xafa shows that (62) does indeed hold. To conclude that Tra{F) = A'(a) we 
will show that 7ra(|Fo|) < oo. The invariant probability measure TVa may be expressed as 

where ka is a normalizing constant. Hence, 

f 

J a 



T^a{Fa)=T^a(\f\)=kaila{\fa\)<<^- 



fa 

Finiteness follows from (iv) and the fact that the eigenmeasure //„ lies in . This proves (ii) and 
the identity in (iii). 

To complete the proof of (iii) we must show that 7r(Fo) = 0. This follows from the normalization 
(48) (assumed to hold for all a) which implies the limits, 

/a ^ 1, i^a^T^, « ^ 0. 

To prove (v) we obtain an alternative expression for Fa- We again consider the unnormalized 
eigenfunction (60). Observe that a fundamental kernel is derived from U\^^a through a scaling and a 
similarity transformation, 

Za = XalfUXa,aIf = [I - Pa + Sa ^ l^al'^ 

with Sa = Xa^fa^so, and Va = ^olf^- We have PaZaG = ZaG — G whenever Tra{G) = 0. 
Using (63) then gives, 

Fa = ^ = Za{F - A'(a)) = Za{F - TTaiF)). 
J a 

It again follows that Fa solves the Poisson equation: It is the unique solution in with fa (-Fa) = 0. 

The desired bound on F^ is obtained as follows. Using Jensen's inequality we know that Vq, = 
log{V/ fa) = log (Fa) solves a version of (V3), 

PaVa <Va-e + bs, 

where £ > and 6 is a finite constant. Using the bound G it follows directly that the function 
Va is uniformly bounded below. The bound on F^ then follows from [23, Theorem 2.3]. □ 
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Proposition 4.10 Suppose that (46) holds. Take P = R, and define the potential operator Uz as in 
(56) with M = s^v. Then Bi := |||C/i|||y < 2bS~^ , and 

jUzllv < Bi{l - \z - l\Bi)-\ \z-l\<B^\ zee. 

Hence both R and P are V -uniform. 

Proof. Under (V4) we have, by (18), 

(e -1){R- I)V = R{P - I)V < -SRV + bRs. 
Rearranging terms then gives 

which we write as 

{R-s^iy)V <V-SiV- u{V)s + biRs , 

where Si = S{e - 1 + S)~^ and bi = b{e - 1 + 
Iterating gives, for all n > 1, 

n—l 



(i? - s (g) < V - 5i^{R - s ^ ufV 

n-l 

+ ^(i? - s (g) ufibiRs - iy{V)s). 



i=0 



Letting n — ^ oo, and applying Lemma 3.2 (ii) yields 

SiUiV <V- u{V) + 2bi < 2biV, 



or WUiWy < 2bi/Si = 2b/S. 

To obtain a bound for z ~ 1 write 



I{z-1) + [I - (R- s0 v) 



Ui{z-1) + I 



Provided |||C/i|||y|z - 1| < 1, we can write = En(l - , and \\Uz\\v < Bi/[l - \z - l\Bi]. □ 

Proposition 4.11 Suppose that (46) holds, let a G R satisfy |a[< | log{l~S)\, and suppose that there 
exists g:X^ (0,oo), satisfying g G and Pag < \a9- Then Pa is V -uniform. 

Proof. The conditions of the proposition imply that there exists 6i < oo such that 

PaV < el"l (1 - 5)V + el"l6.s <V + bis. 
From the resolvent equation (18) we then have, for some 62 < 00, 

ReV <V + biRes, 
where Rg is the resolvent kernel defined through P^. 
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We also have A(a) > for all a / under (46), and hence the g.p.e. 7^1 for Rq is also strictly 
greater than one when 9 > \a\ > (see (37)). Choosing sq G and i/q € Mp so that > sq <8) I'o, 
we find that 

-rg\Re -so® i^o)V <V-sV + biReso, 
where e = 1 — 7^^ > 0. Exactly as in the proof of Proposition 4.10 we conclude that 

sU^gV < 62 [/-ygiie So < '^hUjgSo, 

where U, = z-^'-^Re - sq (g, uqY . 

From Theorem 3.3 (iii) and the conditions of the proposition we know that / = U^^sq satisfies 
f < eg for some constant c, and hence / G L^. It follows that |||C/7g|||y < 00, from which F-uniformity 
of Rg, and hence of Pa, immediately follow. □ 

Proposition 4.12 Suppose that (46) holds. Then there exists Eq > 0, b < 00 sueh that: 

(i) Son{zeC:\z\>l-eo} = {l}. 

(ii) \l[Iz - (P - 1 (g) 7r)]-i|||y < b when 1^1 > 1 - £o- 

(iii) Pa is V -uniform for all a = a + iiv G C satisfying 

e - 1 



\u}\ < £0 and \a\ < a :- 



2b -5 



Moreover, the associated g.p. e. Xa is an analytic function of a in this range, and so is the 
corresponding eigenfunction fa{x) (for each fixed x eX). 

(iv) The eigenf unctions fa are (uniformly) bounded from below when a is real: 

inf /„(x)>0, xeX. 

—a<a<a 

Proof. Results (i) and (ii) follow immediately from Proposition 4.10. To prove (iii) we must 
establish an appropriate range of real a for which Pa is ^-uniform. From Proposition 4.10 we know 
that P = Pq is F-uniform. 

For any function Go G Loo, set go = exp(Go), and consider the kernel Ig^R, where, as before, Ig^ 
denotes the kernel Ig^ix, •) := goix)5x{-)- We assume that the convergence parameter for this kernel 
is equal to one. It then follows from Proposition 4.10 that the function below lies in provided 
\\go\\^^>l-B^\ 

00 

9rix) = ^[IgoiR-S<gu)flg,S, 
k=0 

and it is clear that Ig^R is in fact F-uniform in this case. Applying Lemma 3.2 (i) we know that 

i^{gr) = 1. 

The function gr solves the eigenfunction equation, 

[/go (i? - S (g) u)]gr = gr- IgoS 
=^ Rgr = Qo^Qr- 
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Setting g = Rc/j. = grQ^ ^ and applying the resolvent equation (18) then gives 

{P-I)g = {P-I)Rgr = {e-l){R-I)gr = {e - l)g - {e - l)gog. 

Hence g is the solution to the multiplicative Poisson equation for the function G = log{g) := — log(e — 
(e — 1)50) • The map go ^ g is one to one. 

We have already remarked that gr G provided || (70 1 1 00^ > 1 — -^f^' hence and g G 
whenever go satisfies this bound. If Bi < e, then this constraint is trivially satisfied. For Bi > e, 
equivalently the function g must satisfy, 

Prom the inequality log(l + x) < x, x ^ 0, we obtain 

where the last inequality uses the bound Bi < 2b/ 5. 

This gives the sufficient condition, ||G||oo ^ ^- Proposition 4.11 implies that IgP is ^-uniform, 
and g G when this uniform bound holds. 

The function G falls outside of the class of functions F satisfying (46), since A(a) > for all a 7^ 
when 7r(-P) = 0, and we have already noted that the spectral radius $^{g) of IgP is equal to 1. However, 
given any a, the function G = aF — A(a) satisfies ^(5) = 1 and G{x) < \a\ — A{a) < \a\, x £ X, so 
that the normalized function satisfies (64) when \a\ <a. This transformation immediately gives the 
desired conclusion in (iii). 

To see (iv), take any A satisfying A > max(Aa, A_a), and set 

00 

Ga=Y.X~''~'P2. 



By irreducibility we can find so:X — > (0,1) and a probability distribution i^q on B satisfying the 
uniform bound, 

Gaix,A) > Reix,A) > so{x)uoiA), x G X, A£ B, a G [-a, a] , 

where 9 = a + log (A). We may assume that vo is equivalent to the irreducibility measure ip. 
It follows that for all a G [—a, a] and all x, 

(A - Xar'faix) = Gafa {x) > So{x)uo{fa) > 0. 

By continuity of fa we obtain the desired uniform bound. □ 

We now develop the consequences of the lattice condition. Our main conclusion is contained in 
Theorem 4.13: The function F is almost-lattice if and only if the spectral radius ^i^^, attains its upper 
bound (i.e. ^i^j = 1) for some u > 0. 

Some of the spectral theory for complex a is most easily developed in a Hilbert space setting. 
Define L2 := {/:X C such that H/Hl = 7r(|/p) < 00}, with the natural associated inner product, 
{h,g) = Tr{h*g), h,g& L2. We note that V € L2 under our standing assumption (46). For any n, the 
induced operator norm of P^: L2 L2 may be expressed. 



II P^nll f ^ 

\lPa\l2 = sup = sup{|E,[r($(0))exp(aS„)5($(n))]| : \\h\\2 < 1, II5II2 < l} 
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We let 7a denote the L2-spectral radius, 

7.:= lim ira^/". 
n— »oo 

When a = iu, the hnear operators {P-^,} are contractions on L2, so that 7^0, < 1. 

Theorem 4.13 provides several characterizations of the almost-lattice condition. It is analogous to 
the variance characterization given in Proposition 2.4. 

Theorem 4.13 (Characterization of Lattice Condition) The following are equivalent under (46), for 
any given lo > 0, —a < a <a: 

(i) L = 1; 

(ii) 7iuj = 1; 

(iii) ia+iu; = L; 

(iv) There exists a hounded function 0:X — > [0, 27r) and do > such that for a.e. x G X [^\, 

exp(^iuj J (F($(s))-(io)(^s) =exp(^ie($(t))-iG(^>(0))) , a.s. [P^]. (65) 

(v) F is an almost-lattice function whose span is an integer multiple o/27r/a;. 

Proof. We first note that by Proposition 2.4 the existence of Q,uj,dQ satisfying (iv) is equivalent 
to the almost-lattice condition (v). To prove the proposition it remains to show that (i)-(iv) are 
equivalent. 

The implications (iv) =^ (i), (ii), (iii) are obvious since, under (iv), we have for all n > 1, 

P:+^U^, ■ ) = e-'^«/.eC(^, • Kd^ for a.e. xeX[i^]. 

We now establish implication (i) =^ (ii). We first note that if ^j^^ = 1 then, from the fact that the 
y-norm is submultiplicative (as it is an operator norm), we must have |||-F'j^|||v' ^ 1 all n. Note also 
that for any g G L^, 

\PZ^"'9{x)\ = |E,[exp(a5„)E.j(„)[exp(a5„)5(<&(m))]]| 

< E^[|E$(„)[exp(a5'm)5r($(m))]|] 

< /|E2,[exp(a5^)5($(m))]|7r(dy) +O(F(x)e-''0"), n,m>l, xGX, 
where 60 > exists by F-uniformity of P. This implies the bound, 

l=C'<liminf(sup||E^[r($(0))exp(a5^)5($(m))]| : ||/i||oo < 1, llslk < l|) • (66) 

We have already remarked that Pa is a contraction on L2. It follows that either |||-P(^|||2 
geometrically fast, or 11^^12 = 1 for all n. We may conclude the latter using (66), and this establishes 
the implication (i) ^ (ii). 

We now show that (ii) implies (iv). The supremum in the definition of |||-Pj^|||2 is attained since 
= 1. To see this, construct for any N > 1 functions ,g^ with L2-norm equal to one, with 

1 > \\h''h\\PZg''h > {h'',PZg'') > \IPSI\2 - i/iv = 1 - i/N. 



29 



Part of the construction ensures that the inner product above is real- valued. These bounds imply that 
\\h^ — Pi^g'^\\2 — 0, iV — ^ oo, which is equivalently expressed, 

E.[|(/i^($(0)))* cxp{iuSn)g^mn))] - 1|] ^ 0, oo. 

It then follows that exp(ia;5„) G (t(<I>(0), <I>(n)), and that there exist hn,gn £ L2 such that 

/i;(^>(0))exp(io;5„)ff„($(n)) = 1 a.s. [P,] . (67) 

Wc may assume without loss of generality that |gi„(a;)| = |/i„(.T)| = 1 for all x since | cxp{iujSn)\ = 1- 
Note that (67) is almost the desired conclusion (iv). In particular, on dividing the expressions for 
n and (n + 1) we obtain the suggestive identity. 

To establish (iv) we show that {gn, hn\ may be chosen as follows: {gn\ is independent of n, with 
common value g € L^o-, and we may construct ^0 £ 1^ such that /t„ = e^^°'^g for all n. The required 
function Q in (iii) can then be taken as a version of — \og{g). 

Applying (67) and appealing to stationarity, wc conclude that for any n,m> 1, 

K+m{^i^))^^P{i^Sn+m)gn+mi^{n + m)) = 1 a.S. [P^], 

and 

h*^{^{n)) exp {(za; Efeir' i^($(A:)))5m($(n + m))} = ^^[hUm exp(ia;5^)5^($(m))] 

= 1 a.s. [P^], 

where 7?" denotes the n-fold shift operator on the sample space. 
Combining (67) with these two identities then gives, 

^:+m(*(0))/in($(0))5:($(n))/i„.($(n))5:,($(n + m))5n+m($(n + m)) = 1. 

On taking conditional expectations with respect to $(0) = x we see that for a.e. x eX [ip], 

K+m{^)hn{x) = E,[5n(^(n))/i;,($(n))5„^(^(n + m))5;+„(^(n + m))] + 0{V{x)e-''°^) 

= 7r{gnh*j7rigmg*n+m) + 0{V {x)e-'''^^ + e-^o"^), n, m > 1. 

Since |5n(a^)| = |^n(a^)| = 1 for all x we conclude from Jensen's inequality that for all n. A; > 1, 

9nix)K+k{x) = Tr{g*hn+k) + ei{x) 

hl^^{x)hn{x) = 7r(/i;+j./in) + 62(2;) 

where |ei(a;)| + |e2(x)| = 0{V{x)e~^°'^). This, combined with (68), shows that the desired expression 
can be obtained as an approximation: For any e > we can find a function G (of the form — \og{gn) 
for large n) and ^0 £ such that for a.e. $(0) = x G X, 

exp(za;F($(0)) - exp(^z(0o + ©(^(1)) - 6(^(0)))) < eV{x) a.s. [P^]. 

This easily gives (iv) . 

Finally we show that (iii) implies (iv). Observe first that we have already established the equiv- 
alence of (i) and (iv). Moreover, (iii) is equivalent to the statement (i) for the transition kernel Pa, 
from which we deduce the implication (iii) ^ property (iv) for the Markov chain with transition law 
Pa. This is equivalent to (iv) for the original Markov chain. □ 
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4.2 Continuous Time 

Wc now translate the definitions and results of the previous section to the continuous-time case. 
Suppose that {P* : t G M+} is a semigroup of operators on L^, with generator A, and with finite 
spectral radius given by 

|:= lim |||P*|||^/*. 

t— »oo 

[Note that the definition of the generator of a positive semigroup given in (32) , immediately generalizes 
to general (not necessarily positive) semigroups.] 

Consider the eigenvector equation Ah = Ah. The functions h we consider will always be of the 
form h = Reho, usually with Hq > 0, where Rq is defined as in (18). When all the integrals are 
well-defined we have the resolvent equation (33), so that 

Ah = e{Re - I)ho. 

This identity allows us to lift all of the previous results to the continuous-time setting. In particular, 
under (V4), Fatou's lemma implies that the resolvent R = R\ satisfies, 

RV <{l- 5i)V + biRs, 

with Si = 5(1 + 5)-\ and h = 6(1 + . It then follows as in the discrete time case that 

\l[I-R + s^u]-%<2h/Si. 

Recall the definition of the semigroup {P^ : t G M+} from (31), where we now allow a to be 
possibly complex. The next lemma offers an expression for the generator of this semigroup, analogous 
to the classical Feynman-Kac formula for diffusions. The result is easy to check via the martingale 
representation (22). 

Lemma 4.14 (Feynman-Kac Formula) The generator Aa of the semigroup {P^ : t G M+} satisfies, 

Aa = A + aF, (69) 

where A is the generator o/{P*}. 

Although none of the generators we consider are linear operators on L^, we may still define the 
spectrum of A, S{A) C C, as the set of z G C such that the inverse [Iz — .4]"^ does not exist as a 
bounded linear operator. We have the generalized resolvent equation, 

z[Iz-A]''^=R,= [ ze-^^PUt, zeC, 

J[0,oo) 

where the integral converges in norm for z ^ S{A) such that |e^| > The generator A is called V- 
uniform if it admits a spectral gap and there is a unique pole Aq G S{A) of multiplicity one, satisfying 
|Ao| = exp(Ao) = i. 

Proposition 4.15 // {P* : t G M+} has finite spectral radius |, then: 
(i) The following statements are equivalent: 

(a) The generator A has eigenvalue Aq G C and associated eigenfunction f G . 
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(b) For 6 > ^, the resolvent Rq has eigenvalue \g = 9~^h.o — 1, and eigenfunction f . 
(ii) The following statements are equivalent: 

(a) A is V -uniform. 

(b) Re is V-uniform. 

The proof of Proposition 4.15 is obvious from (34). 

Using these identities, the following results may be proven as in Theorem 4.1 and Theorem 4.2. 
The definition of a is given in (49) . 

Theorem 4.16 (Multiplicative Mean Ergodic Theorem) Suppose that the Markov process * = {$(i) : 
t G M+} and the functional F satisfy (46), and write a = i ) as before. Then there exists 



uj > 0,So > 0, such that for any a = a + iu> E C with \a\ < a, \lo\ < to, there exists A(a) G which is 
maximal and isolated: 

Re (A(q;)) = max{Re (A) : A G 5^} and 5^ n : Re (z) < Re (A(q;)) - So} = A{a). 

Moreover, for any such a, there exist fa G and fia £ MX > satisfying (48), and 

(i) For allx eX, AeB,te R+, 

P'afaix) = XUaix); 

The function fa is also an eigenfunction for Aa '■ 

Aafa = Ha)fa- 

(ii) There exist bo > 0, Bq < oo, such that for all x G X, t > 0, 

E^[eM(^St-tA{a))gmt))]-Qag{x)\ < Bo\\g\\ve-^°'V{x) 
E^[exp(a5t-tA(a))] -/a(x)| < Bo\a\e-'"''V{x). 

Theorem 4.17 (Bounds Around the zw-Axis) Assume that the Markov process ^ = {^{t) : t G M+} 
and the functional F satisfy (46). 

(NL) Suppose that F is strongly non-lattice. For any Q < uo < oji < oo, there exist bo > 0, Bo < oo 
(possibly different than above), such that 



E^[exp(a5t - tA{a))] < SoF(x)e-^o* 
for all a = a -\- iu with \a\ < a and < \uj\ < 



X eX, t>o, 



(70) 



(L) Suppose that F is almost-lattice with span h > 0. For any e > 0, there exist bo > 0, Bq < oo 
(possibly different than above), such that (70) holds for all a = a -\- ico with \a\ < a and e < < 
27r//i-e. 
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5 Edgeworth Expansions for the CLT 



Here we show how the multiphcative mean ergodic theorems of the previous section can be used to 
obtain Edgeworth expansions for the central hmit theorem (CLT) satisfied by the partial sums of a 
geometrically ergodic Markov chain; see, e.g., [37, Ch. 17] for the standard CLT. 

Throughout this section we consider a discrete-time Markov chain * and a bounded functional 
F : X ^ M. Recall our standing assumptions (46) about $ and F. To avoid repetitions later on, we 
collect below a number of properties that will be used repeatedly in the proofs of the results in this 
and the following section. They are proved in the Appendix. 

Properties. Assume that the discrete-time Markov chain $ and the function F : X ^ M satisfy 
(46), and let Sn denote the partial sums as before. Choose and fix an arbitrary x € X, and let 

m„(a) := Ej;[exp(Q;S'„)] , n > 1, a G C. (71) 

PI. There is a sequence {e„} such that 

mn{a) = exp(nA(a))[/Q,(a;) -|- |a|e„] , n > 1 , 

and |e„| — exponentially fast as n — oo, uniformly over all a G O (with CI as in Theorem 4.1). 

P2. If F is strongly non-lattice, then for any < ojo < wi < oo there is a sequence {e^} such that 

^n(a) = exp(raA(a))e^ , n> 1 , 

and |e^| — > exponentially fast as n — ^ oo, uniformly over all a = a -|- za; with \a\ < a and 
^0 < I'^l < '^^i (with a as in Theorem 4.1). 

P3. If F is lattice (or almost lattice) with span h> 0, then for any e > 0, as n — oo, 

sup \mn{iio)\ exponentially fast. 

e<\ui\<2n/h~e 

P4. The function A(-) is analytic in n with A(0) = A'(0) = 0, and A"(0) = > 0. Moreover, 
cr^ := A" (a) is strictly positive for all real a G [—a, a]. 

P5. The third derivative ps := A'"(0) can be expressed as 

oo 

P3 = E,[f3($(0))]+3 ^ E,[F2(cI>(0))F($(z))] 

i=—oo 

OO 

+6 E,[F(#(0))F(#(i))F(#(i +i))]. 

P6. Let F be the solution of the Poisson equation given by (27), and write 

A„ := E,[Sn]-F{x). 
Then |A„| — exponentially fast as n — oo. 
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P7. The eigenfunction is analytic in a G Q, it satisfies fa\a=o = 1) it is strictly positive for 
real a. Moreover, there is some lJq € (0;^^] (depending on x), such that 

5{iu;) := | log/ii^(x) — iLuF(x)\ < (Const)u;^, 

for all < aJo, where F is as in P6. 

The following two results generalize those in [41, 30, 8]. 

Theorem 5.1 (Edgeworth Expansion for Non-Lattice Functionals) Suppose that * and the strongly- 
non-lattice functional F satisfy assumption (4-6), and let Gn{y) denote the distribution function of the 
normalized partial sums Sn/(^\/n: 

G„(y):=P,|^ <y|, y G R. 

Then, for all x G X, 

Gn{y) = Q{y) + ^\^{l-y^) - F{x)\+o{n-"^), n ^ oo, (72) 

uniformly in y £ where 7(7/) denotes the standard Normal density and G{y) is the corresponding 
distribution function. 

It is perhaps worth noting the way in which the convergence in (72) depends on the initial state 
X of the Markov chain: This dependence is only manifested via the solution F{x) to the Poisson 
equation. Also observe that, since (72) holds for all y G M, the restriction on F being \F\ < 1 can 
clearly be relaxed to ||-F||oo < 00. 

For the proof of the theorem - given in the Appendix - it is convenient to consider the zero-mean 
version of the normalized partial sums, 



Let Gn{y) denote the corresponding distribution function. In the proof we show instead that 

Gniy) = g{y) + -^{l-y'h{y) + o{n-'/% n - 00, (73) 

uniformly in y G M. Prom this it is a straightforward calculation to deduce (72) via a Taylor series 
expansion and using property P6. 

Before stating our next result we recall the following notation. If G is the distribution function of 
a lattice random variable with values on the lattice {d + kh, G Z}, the polygonal approximation G# 
to G is the piecewise-linear distribution function G^{y) that agrees with G{y) at the mid-points of 
the lattice, y = d + (k + l/2)h, A; G Z, and is linearly interpolated between these points. The function 

is precisely the convolution of G with the uniform distribution on [—h/2, h/2]. 

Theorem 5.2 (Edgeworth Expansion for Lattice Functionals) Suppose that F is a lattice functional 
with span h > 0, and assume that F and $ satisfy assumption (46)- With Gn{y) as in Theorem 5.1, 
let Gn {y) denote its polygonal approximation. Then, for all x eX, 



G*{y) = G{y) + ^ ^{l - y^) - F(x) + o{n-^l% n^oo, (74) 



P3 



6a2 
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uniformly in y G R. In particular, writing hn = hja^pn, (74) holds with Gn{y) in place of Gn{y) at 

jj 

the points {y = {k + l/2)hn, k G Z}, and with [Gn{y) + G„(j/— )]/2 in place of Gn{y) at the points 
{y = khn, k G Z}. 

The proof is given in the Appendix. As with Theorem 5.1, it is more convenient to prove a version 
of (74) in terms of G„ (y) rather than Gn{y), where G„ is the polygonal approximation to G„. In the 
proof we show that 

Gt{y) = G{y) + :^^{l-y^)l{y) + o{n-''% n^oo, (75) 

uniformly in y G M. Then (74) follows from (75) in the same way that (72) follows from (73). 

Before moving on to large deviations we note that, although we shall not pursue these directions 
further in this paper, using the multiplicative mean ergodic theorems of Section 4 it is possible to 
prove higher-order Edgeworth expansions, as well as precise local limit theorems for the density (or 
the pseudo-density, when a density does not exist) of Sn- The Edgeworth-expansion proofs follow the 
same outline as those in the case of independent random variables; cf. [17, p. 541]. For the local limit 
theorems, one can apply directly the general results of [7, Sec. 2]. 



6 Moderate and Large Deviations 

In this section we use the multiplicative mean ergodic theorems of Theorem 4.1 and Theorem 4.2 to 
prove moderate and large deviations results for the partial sums of a Markov chain. As in Section 5, we 
consider the partial sums {Sn} of a bounded functional F of the discrete-time, geometrically ergodic 
Markov chain 

First we note that the multiplicative mean ergodic theorem together with the analyticity of A{a) 
in a neighborhood of the origin (see properties PI and P4 in the previous section) immediately imply 
that the partial sums Sn satisfy a moderate deviations principle (MDP). We state this MDP, without 
proof, in Proposition 6.1. Its proof is based on an application of the Gartner-Ellis theorem, exactly 
as in the proof of Theorem 3.7.1 in [13]. 

Proposition 6.1 (Moderate Deviations) [10, 11] Suppose the Markov chain $ and the functional F 
satisfy (46), and let {6„} he a sequence of constants such that 

bn 1 bn ^ 

—p= — ^ oo and ^ 0, n — ^ oo. 

s/n n 

Then, for all x & X and any measurable B cM., 

- inf (^) < lim inf J—logpJ^eB] 

< lim sup -2^— log R, G S j < - inf ( , 

n^oo bi/n [bn J y^B\2a^J 

where B° denotes the interior of B and B denotes its closure. 

Note that the same result holds for the centered random variables [Sn — ^x{Sn}]/bn in place of 

Sn/bn- 
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6.1 Large Deviations for Doeblin Chains 

Suppose that $ is a Doeblin recurrent chain, that is, suppose that for some m > 1, e' > 0, and a 
probabihty measure v' , we have that P"* > e'v' . Equivalently, the Doebhn condition can be stated as 

R> eu , for some e > and a probabihty measure u , (76) 

and this, in turn can be seen to be equivalent to geometric ergodicity with a bounded Lyapunov 
function V in (V4); see [37, Theorem 16.0.2]. Then the state space X is small, and the results of [43] 
can be applied to get large deviations results for the partial sums Sn- For example, for a Doeblin 
chain with a countable state space X and with '0=counting measure, the partial sums Sn satisfy a 
large deviations principle (LDP) under the distributions Rj, for any x G X. 

But the situation is more complicated when $ is stationary, i.e., when $(0) ~ vr. In the following 
proposition we consider the LDP for the partial sums Sn under the stationary distribution Ptt. 

Proposition 6.2 (Large Deviations) Suppose the Doeblin chain ^ and the functional F satisfy (46), 
and let a = {^^)e, where e is as in (76). 

(i) The partial sums Sn satisfy an LDP in a neighborhood of the origin: For any c G (0, A'(a)) and 

any d G (A'(— a),0), we have 

hm ilogP^{S„>nc} = -A*(c) 
lim ^logP^{Sn <nc'} = -A*(c'), 

n— »oo 

where 

A*(c):= sup [ac-A(a)]. 

—a<a<a 

(ii) Part (i) cannot in general be extended to a full LDP on the whole real line. 

Proof. Integrating the multiplicative mean ergodic theorem in (50) with respect to tt and noting 
that 7r(/a) € (0, oo) for all |a| < a, we get that 

^ log E^[exp(aS'„)] A(a), n ^ oo, 

for all real a G [—a, a]. Since A(a) is analytic, (i) follows from the Gartner-Ellis theorem [13, Theo- 
rem 2.3.6]. To see that in the Doeblin case a = {^E^)e, note that in (V4) we can set F = 1, s = e, 
take < (5 < 1 be arbitrary, and define b = S/e. We then have a version of (V4), 

PV = V = {1-5)V + bs. 

Using the definition of a given in Theorem 4.1 then gives. 

Part (ii) follows from the counter-example in Proposition 5 of [5]. □ 
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6.2 Exact Large Deviations for Geometrically Ergodic Chains 



Next we consider the more general case of geometrically ergodic Markov chains, satisfying our standing 
assumptions (46). With a as in Theorem 4.1, let {A\A) denote the interval 

{A' ,A) := {A'(a) : -a < a < a}, 

and note that = 7r(F) = A'(0) G {A', A). Recall the definition of A*(c) in Proposition 6.2. 

Theorem 6.3 (Exact Large Deviations for Non-Lattice Functionals) Suppose that $ and the strongly- 
non-lattice functional F satisfy (46), and let c G (0, A). Then, for all x eX, 

P.{Sn>nc} ^ fo^ ^-nK-ic)^ ^^^^ 

a-sj Zima- 

where a is chosen so that A' (a) = c, and Ua '■= A" (a). A corresponding result holds for the lower tail. 

It is perhaps worth pointing out that the way in which the large deviations probabilities Px{Sn > 
nc} depend on the initial state x of the Markov chain is via the solution fa{x) to the multiplicative 
Poisson equation. 

Although the proof (given next) relies on an application of a general result from [7] , the main idea 
is similar to the proof of the corresponding result for independent random variables [1]: First, as in the 
case of finite state space [39], we perform a change of measure that maps the transition kernel P to the 
twisted kernel Pa- Since $ is geometrically ergodic, by Proposition 4.12 Pa is l/-uniform. Therefore 
Pa is 14-uniform by Proposition 4.9, and hence it is geometrically ergodic by Corollary 4.7. Therefore 
we can apply the Edgeworth expansions of Section 5, and complete the proof along the lines of the 
corresponding argument in the case of independent random variables; see, e.g., [13, Theorem 3.7.4]. 



Proof. Choose and fix an arbitrary x G X. The result of the theorem will follow by an application 
of [7, Theorem 3.3]. We consider the moment generating functions m„(a) of 5^, defined in (71) for a in 
the interior of the compact set J7 in Theorem 4.1. [Note that, although our J7 is different from the open 
disc used in [7], a close examination of the proof of [7, Theorem 3.3] shows that the result continues 
to hold when the open disc of radius a is replaced with the interior {a = a + iuj : \a\ < a, \uj\ < lo} of 
the strip as long as cJ > 0.] 

We will make repeated use of the properties PI - P7 stated in Section 5. Prom the definition of 
mnipi) it is easily seen that it is an analytic function of a, and from PI and P4 it follows that m„(Q;) 
is nonzero on 17, for all n large enough (uniformly in a). 

Let A„(a) be the normalized log-moment generating function 

A„(q;) := -logm„(Q;) , a G J7 , 
n 

and 

A* (c) := sup [ac - A„(a)] , c G M. 

— a<a<a 

The main step in the proof is the verification of the assumptions of [7, Theorem 3.3]. Most of them, 
plus some other technical properties, are established in the following lemma (proved in the Appendix). 
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Lemma 6.4 Under the assumptions of the theorem: 

(i) For n large enough there is a unique an G (0, a) such that AJ^(a„) = c and A* (c) = a„c — A„(a„). 

(ii) Similarly, there is a unique a G (0, a) such that A' (a) = c and A*(c) = ac — A(a). 

(iii) an ^ a as n ^ oo, and, in fact, an — a = 0{^). 

(iv) A'nittn) crl, asn^oo. 

(v) A* (c) ^ A*(c) as n oo, and, m /act, 

A;(c) = A*(c)-ilog/„(a;) + o(i). 

The theorem foUows from [7, Theorem 3.3], upon verifying condition (c) of [7, p. 1685]. For that, 
it suffices to show that for all < loq < loi < oo, 



sup 

uJ()<|aj|<tt)i 



m„(a + iu>) 



mn{a') 



uniformly in a' in a ncighborliood of a. But the above convergence actually takes place exponentially 
fast, as can be easily verified using properties PI and P2 from Section 5. □ 

Theorem 6.5 (Exact Large Deviations for Lattice Functionals) Suppose that $ and the lattice func- 
tional F satisfy (4-6), and assume that F has span h > 0. Let {c^} be a sequence of real numbers in 
(e,^ — e), for some e > 0, and assume (without loss of generality) that, for each n, Cn is in the support 
of Sn- Then, for all x G X, 

PJSn>nCn} ^, e-"^"^'^"), n-^oo, (77) 

where each an G (0,a) is chosen so that A^(a„) = Cn- A corresponding result holds for the lower tail. 

Note that in the lattice case we have given a slightly more general version of the result given in 
Theorem 6.3. If it turns out to be the case that the Cn converge to some c G (e,^ — e), so that the 
corresponding a„ converge to some a G (0,a) at a rate 0(l/n), then applying Lemma 6.4 as before, 
from (77) we obtain, 

Px{Sn > nCn} '}M^)^^^-~nA*ic) 

^ (l-e-'*«)V2WI 

where = A" (a). 

Proof. Choose and fix an arbitrary a; G X. The proof parallels that of Theorem 6.3, relying on 
an application of Theorem 3.5 from [7]. A close examination of its proof in [7] shows that, as in the 

case of Theorem 3.3 above, Theorem 3.5 remains valid if we replace the open disc of radius a by the 
interior of the strip $7. Proceeding as in the proof of Theorem 6.3, we now need to verify condition 
(c') on [7, p. 1686]. For that, it suffices to show that for that for all ujo G (0, 7r//i), 

runia' + iu) 



sup 

LUo<\Lj\<TT/h 



mn{a') 



uniformly in a' G {e,A — e). Using properties PI and P3 from Section 5, it is easy to see that the 
above convergence actually takes place exponentially fast, and this completes the proof. □ 
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7 Examples 



7.1 Countable State Space Models 

Let ^ be a discrete-time Markov chain with a countable set X of states, and let ip be counting measure. 
Suppose $ is irreducible in the usual sense that R[x, y) > for all x, y € X. Then Ig is a small function 
for any 9 & X, with associated small measure = P{6, ■)■ Using this small function and measure in 
Lemma 3.2 (i) leads to the following characterization of A(a) for real a, 

Tg-l 



A(a) = inf{A: Ee[exp(^[aF($fc)-A])] < l} ; (78) 



k=0 

see [2] for details. When the infimum is attained and we may justify differentiation with respect to a, 
then 

l = Ee[cxp(^[aF($fe)-A(a)])] =^ = E, - A'(0)]" . 

fe=0 fc=0 

This gives a more transparent proof of the identity A'(0) = 7r(F). 

The simple queue. For our purposes, the simplest interesting example of a countable state 
space chain is the M/M/1 queue. This is the reflected random walk * on X = {0, 1,2,.. .}, with 

P(x, X +1) = p, P{x, {x — 1)+) = q, X G X, 

where p + q = 1. We assume that p = p/q < 1 so that the chain is positive recurrent. As we show 
next: 

(a) * is geometrically ergodic; 

(b) it is not Doeblin recurrent; 

(c) with F = Iqc — 7r(0'^), the multiplicative mean ergodic theorem (50) does not hold for all real a. 

It is also not hard to show that $ does not satisfy (mV3) for any / with finite sublevel sets, so 
that, in view of the discussion in Section 2.2, the Donsker-Varadhan conditions do not apply. More 
importantly, as Wu recently showed, not just the conditions, but also the large deviations conclusions 
of the Donsker-Varadhan theory fail in this case [56]. Therefore, this example does not fall under any 
of the standard conditions known to imply large deviations results. 

Below we also show that our central technical result, the multiplicative mean ergodic theorem (50), 
cannot in general be extended to hold on the entire real line. 

First note that one can compute directly the expectations, 

< oo, 0<r<P , . 

= oo, r>P ' ^ ^ 

where (3 = {4qp)~i > 1. To construct a Lyapunov function, consider V{x) = Tq, for ro > 1: 

PV(x) = / (P'-o + qro')V{x) x > 1 

I {pro + q)V{0) x = 0. ^ ' 
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Figure 1: The soHd-curve shows the log-moment generating function A(a), a G M, for the M/M/1 
queue with F = 1qc — 7r(0'^). It is strictly convex for a < a*, and it is linear for a> a* . 

Choosing a minimal value for (1 — 5) := (pro + QVq^) gives ro = and a solution to (V4): 

PV{x) = ^/^V{x) = '^'W{x), X>1. 

It easily follows that, with r = jS, E^[r'^"] = V{x) = p^^^^, a; > 1. This gives the finite bound in (79), 
and shows that $ is geometrically ergodic with Lyapunov function V . 

The easiest way to see that $ is not Doeblin recurrent is to notice that, in k time steps, $ cannot 
visit more than its 2k neighboring states, which implies that the state space is not small; see [37, 
Theorem 16.0.2]. 

Now let F = Iqc — 7r(0^). Using the characterization (78) with 6 = 0, we find that A(a) is the 
unique solution to the fixed-point equation, 

Eo[exp{(7roa - A(a))To}] = e" , 

where ttq := 7r(0). It follows from (79) that exp{7roa— A(a)} < (3 for all a G M. Also, from the fixed-point 
equation it follows that if 0* :=logEo[/3 '^], then A(a*) = vroo* — log /3. But since exp{7roa*—A(a*)} = (3, 
and (vToa — A(a)) is nondccrcasing in a, from (79) wc conclude that A(a) = a — log/3 for all a > a*, 
and hence A" (a) = for a > a*. [To see that (TToa — A(a)) is nonincreasing, simply recall from 
Proposition 4.9 that A' (a) = 7r(a) so that A' (a) < sup^, F(x) = ttq.] But as we saw in property P4, 
the multiplicative mean ergodic theorem (50) implies that A" (a) > for all a for which it is valid, 
therefore it cannot be valid for real a > a* . 

A plot of A(a) for F = Iqc — 7r(0^) is shown in Figure 1. 

7.2 Diffusions 

Consider an elliptic diffusion on a manifold X. We assume that * is non-explosive, so that the sample 
paths are continuous on [0, oo) with probability one. It is then strong Feller and ■^-irreducible, where 

ip is Lebesgue measure on X (see e.g. [47]), and compact subsets of X are small. 
Consider the special case where X = M" and the diffusion term is constant, 

where A denotes the Laplacian. If /„ G solves the multiplicative Poisson equation for some a G M, 
we may consider the twisted process that is, the Markov process with transition semigroup defined 
as before, 

P'a{x,dy) = \-'f-\x)Pi{x,dy)Uy) , t>0. 
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If (69) holds, then the generator Aa of $a is given by 

Aa = (/i + V^cT^F^) • + ia^A^, 

(81) 

where Fa = log{fa)- Note that the twisted process has the same diffusion term as the original - only 
the drift is affected by the twisting. 

Reflected Brownian motion. Diffusions with reflection are currently a popular model in the 
operations-research area. Consider for example a two-dimensional reflected Brownian motion (RBM) 
* on X = Rl, with normal reflection on each boundary. We show below that (when the drift is 
negative) $ is geometrically ergodic. But it is not Docblin recurrent, and it does not satisfy (mV3) 
for any / with compact sublevel sets (for the same reasons as in the reflected random walk example 
above) . 

Within the interior of X, the sample paths are identical to those of the affine stochastic differential 
equation (SDE) model, 

d<^>i = -5idt + dWi, i = l,2, 

where W = {Wi,W2) is a standard Brownian motion (BM) on M^, and the drift term di is positive 
for each i. A characterization of the generator can be found in [55]. 

Suppose that V-.M.^ — R+ is smooth, and suppose that the following boundary conditions are 
satisfied, 

d d d d 

^V<—V, XI = 0; v>—V, X2 = 0. 82 

Then, with L = —5 + | A, the process below is a supermartingale, 



m{t) = V{^{t)) - [ LV ($(s)) ds. 

J[o,t) 



A candidate Lyapunov function is the quadratic, Vo{x) = since LVq = —26 ■ x + l is negative 

for large x, and the boundary conditions (82) are satisfied. The supermartingale property implies that 



[Vomt))]<-5o [ E[^i{s) + ^2{s)]ds + t, 

J\0,t) 



" '[0,t) 

where 6o = 2 min((5i, (52). This may be seen as a generalization of (V3), with f{x) equal to a norm on 
M^. To obtain a version of (V4), first consider Vi = \'Vo. We have, for some di > 0, and some B < oo, 

Ex[Vi{<^{t))] < -Sit, 0<t<l, \\x\\ > B. 

Finally, setting V{x) = exp{pVi{x)), we can find /3 > sufficiently small such that 

E^[V{^{t))] < exp(-/3t)y(x), < t < 1, ||ar|| > B. 

We conclude that the RBM is geometrically ergodic, provided the reflection is normal and the drift 
is negative. Therefore, for any bounded functional F, from Theorems 6.3 and 6.5 we get precise large 
deviations bounds for the time-averages {St}, at least in some interval around the mean 7r(F) of F. 
Moreover, in view recent results in [6] (where a detailed study of large deviations properties of RBM 
(in one dimension) is performed), we should not expect the limit theorems 6.3 and 6.5 to hold on the 
whole real line. 
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In general, geometric ergodicity depends upon the interaction of the drift vector and the reflection 
vectors along the boundaries. In multidimensional models it is not always obvious how to choose an 
appropriate Lyapunov function, but one can devise numerical methods to search for a quadratic Vq 
satisfying the required constraints; see [34, 49]. 

Acknowledgments 

Several interesting discussions with Amir Dembo on large deviations properties of Markov chains are 
gratefully acknowledged. 

Appendix 
Proof of P1-P7 

PI, P2, P3 and the analyticity of A{a) follow from the multiplicative mean ergodic theorems in 
Theorem 4.1 and Theorem 4.2. 

To establish P4, first note that A(0) = follows from the uniform convergence in Theorem 4.1. 
Similarly it follows that A'(0) = 7r(F) = and that A"(0) = hm„(l/n)Vara:(5n) = > by 
assumption (46) and Proposition 2.4. 

On considering the kernel Pa for real a G 0,, since A"(0) > 0, A"{a) > for all such a, and 
TTaiF) = A'(a) by Proposition 4.9, it follows that A'(a) = TTa{F) > for all nonzero a € ^2. Now, if 
A"(a) = is zero, then by (30) in Proposition 2.4 it follows that 7r(F — 7ra(F)) = 0. This is impossible 
since Tra{F) > 0. 

The exponential convergence in P6 is given in Theorem 2.5. The analyticity of fa is stated in 
Theorem 4.12, and fa\a=o = 1 by PI. Proposition 4.9 combined with Proposition 4.12 give P7. 

Property P5 requires more work. For a neighborhood O of zero the function given below is a 
constant times the normalized eigenfunction given in (61): 



fa — 



a 



aeO. 



It is the unique solution in satisfying 7r(/a) = 1. Hence, for all k, 




We have a form of the quotient rule. 



fa - -Ha^H'^H^^l, 



and after repeated differentiation we obtain 




d^Ja 



+ 3 H~ ^ H'^ H~ ^ H'^ H~ ^ 1 
+3H~^ H'^H^^ H'^H~^1 
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Evaluating at a = 0, wc have H^^ = Z = [I - P + U]-^ and /x-^a = [-^ - {If)''P], k > I. 

a=0 

Using HZ = n, and Zl = PI = 1 then gives, 



= ^(^/") = Q^IfPZIfPZF 

+SUIfPZ{F^ - a^) 
+3n(/|, - (j'^)PZIf 

The proof is then complete on interpreting these formulae, since A'"(0) = Aq", and 

oo 

PZG {x) = 7r(G) + E.[G($(A:)) - 7r(G)] , 

for any function G G L^. □ 
Proof of Theorem 5.1 

Wc follow closely Feller's argument in the proof of Theorem 1 in [17, p. 539], leading to the statement 
(73). Choose and fix a; G X arbitrary. For ra > 1, define 

M„(q;) := Ej;[exp(Q;S'„)] = m„(Q;) exp(-Q;Ej;{S'„}) , a G C, (83) 

and the distribution functions 

^n{v):=Q{v)--^^^{y''-i)i{y), yeR, (84) 

with corresponding characteristic functions 

M^) ■■= exp(-a;V2) (l + ^^^) , ^ e M. (85) 

Let e > arbitrary. Choose A large enough so that A > 24(e7r)"^|*^(y)| for all y G M, n > 1. 
From Esseen's smoothing lemma given in [17, p. 538], with T = Ay/n we get that. 



\Gn{y)-^n{y)\<- f 



Mn { ^ ) - 0n(a;) 



+ yeR. (86) 



To prove (73) it suffices to show that this integral is o(n~^/^). 

We first consider the integral in the range B\pn, < \uj\ < A^pn^ with < S < min{(TcUo, A} to be 
chosen later (where cJo is as in P7). Applying the change of variables t = oj/{a^/n), this integral is 
bounded above by 

^/ \mniit)\dt + ^ \(pn{cTVnt)\dt. 

The second integrand converges to zero exponentially fast, uniformly over t in that range, and the 
first integrand converges to zero exponentially fast by P2. Therefore, the above expression is certainly 
no larger that o(n~^/^). 
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Next we consider the integral in (86) in the range < B^pn. From the definition of M„ and by 
properties PI and P6, after the change of variables t = ujl{a^/n) this equals 



1 



exp i^-itl^n - itF{xfj exp {nA{it)) [fit{x) + itcn] - (pnita-s/n) 



Expanding A (it) in a Taylor series around zero yields 

R 

1 



dt 



-j^^ exp(-intV2) |exp {-itA„ + ^{itfM"{it) + log{fu + iten) - itF^ - 1 - "^{itf 
for some real s = s{t) with |s| < Bja. Noting that, 

iUr. 

fit 



dt 



log{fit{x) + iten) - itF = 5{it) + log 1 + 



where 6{-) is as in P7, the second exponent in the above integrand can be written as 

^{itfK"'{it) - it An + S{it) + ite'l,{it) 
where el{it) := [log(l + j^)\/{it), and 

kn(^*)l ~^ exponentially fast, n — ^ oo, 
uniformly in \t\ < B/a (by PI). Therefore, the integral we wish to bound is 



(87) 



1 /"^ . 1 
- / exp(-2 



ntV^) |exp {|(it)3A'"(is) - it An + ite'^iit) + 5(it)} - 1 - "^{itf 



dt 



To show that this is o(n ^/^) we will apply the following simple inequality from [17, p. 534], 

|e"-i-/3|<(|a-/3| + l|/3|2)e^ 



(89) 



where 7 > max{|Q;|, First we choose B small enough so that the following four bounds hold for 
all \t\ < B/a, 

(a) |A'"(it)-/93|<6e 

(&) |3lA'"(^0l<^ 

(c) \m\ < \ 

6^^ 4' 

where (o) and (6) are possible by the analyticity of A(-) and the definition of pz in P5, and (c) is 
possible because of P7. Then, writing 

a := '^{itfA'"{is) - it An + it^^{it) + 6{it) 
and P := "^{itf 
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using (5), (c), and (87), we can bound 



|a| < ntV-^ sup \A"\it)\ + \t\[\An\ + Kiit)\] + \d{it)\ 

60- \t\<B/a 

< -nfa'^ + ^ + e/2 
4 n 

< \nt^a^ + e , 
~ 4 

where the last two inequalities are valid after taking n large enough. Similarly, using (a), (6), (87), 
and P7, 

\a-p\ < ^|t|^|A"'(zs)-/93| + ^ + (Const)t2 

\t\ 



for n large enough, and using (d), 



< enltl'^ + — + (Const)r , 
n 



Applying inequality (89) with 7 := jnt a and in conjuction with the last three bounds, the 
integral in (88) is bounded above by 



exp(— 4nt cr + e) 



en\tf + (Const)*^ + M + i !^ 
n 2 V 6 



dt 



and changing variables back to a; = t{a^/n), 



— / e * 



e 



+ (Const^ + + 



< 



' uj^e~i^ dio] + O 



Since e was arbitrary this shows that the integral in (88) is o(n~^/^), and completes the proof. □ 



7.3 Proof of Theorem 5.2 

We follow closely Feller's argument in the proof of Theorem 2 in [17, p. 540]. Choose and fix an 
arbitrary x G X. Let e > arbitrary, and let be the distribution function (84). Recall that 



= Gn* U[-hn/2,hn/2] and G* = * U[-hn/2,hn/2]. Proceeding as in [17, p. 540] along 
equations (4.9) and (4.10) (with in place of "G" and := * U[-hn/2, hn/2] in place of "G#") 
we obtain, after taking ^4 > large enough. 



< 



\oj\ yjn 



yGM, (90) 



where Mn and ^„ are defined in (83) and (85), and 

s„(a;) := 



sin(^/i„u;) 
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To prove (75) it suffices to show that the integral in (90) is o(ra~^/^). We separately consider the 
integral over |a;| < B^/n and over B^/n < \uj\ < A^s/n, for some conveniently chosen B < m.m{auj, A}, 
where uJ is as in P7. Noting that |sn(w)| < 1 for all u, the integral in the former range can be shown 
to be of order o(n~^/^) as in the non-lattice case. 
Therefore, it remains to show that 



B^<\u>\<A^ 



M„, 



< 



B^<\uj\<A^ 



lUJ 



\L0\ 



\UJ\ 



■ du) + 



B^<\w\<A^ 



du = o(n-^/2)_ (91) 



The last integral above is easily seen to decay exponentially in n (as in the proof of Theorem 5.1), 
and hence we concentrate on the former integral, which, after the change of variables t = u)/{ay/n), 
becomes ^ 

- / \mn{it)\\sui{\ht)\ -2". 

(7 — I I — cr 



Notice that | sin(^/it)| and mn{it) are periodic functions of t with period 2ti /h. Consider, without 
loss of generality, the range of t G [B/a,A/(T\ (the case of negative t is similar). Let {k — 1) denote 
the number of full periods of length 27r//i in that interval. Then, since | sin(y)/y| < 1 for all real y, 



h 



I / \ I I • / 1 7 \ I /C(T 

^ \mn{it)\\sm{^ht)\ ■^<^ 1^ 



27T B 

h a 



\mn{it)\dt + 



m„(zt)| \t\ dt , 



where the first integral converges to zero exponentially fast by P3. Using PI to bound m„(it) and 
expanding A{it) in a Taylor series, the second integral is 



C I ^\t\\ex.p{nA{it)}\dt <C' J i+ii„™r i^^2„2i 



\t\ I exp{— int a }\dt 
for some constants C, C and C" . This establishes (91) and completes the proof 



91 

n 



□ 



7.4 Proof of Lemma 6.4 

Part (ii) is immediate by the choice of c and property P4. For part (i) note that by the uniform 
convergence of A„(a) to A(a) (property PI) we also have convergence of their derivatives, so for n 
large enough we can pick a„ as claimed, and since A^(a) eventually will be strictly positive for all 
a G (—a, a), this is unique. 

For part (iii) recall that A^(a„) = c = A'(a), so the fact that a„ — a as n — oo follows by the 
uniform convergence of the functions A^. Moreover, expanding A^(a) around a = a„ and using PI, 



= A'(a)-A:,(a„) 

= [A;(a)-A;(a„)]-[A'(a)-AUa)] 

= [{a-an)K{an) + 0{a-anf\ + ^ 



n 



■ ^og{fa{x) + aen exp{-nA(a)}) 



[{a - an)K{an) + 0{a - anf] + O Q j + 0(e„), 
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where in the last step we used P7. Taking n large enough so that {A^(a„)} is a bounded sequence, 
bounded away from zero from below, this implies that (a„ — a) = 0(l/n). 

Part (iv) is an immediate consequence of P4 and of the uniform convergence in PI. Finally for (v) 
we have from (i), (ii), and PI, 

A*(c) = OnC - A(a„) - ^ log + aen] 

= + {an - a)c+{A{a) - A(a„)) - ^log/a„ - ^ log(l + a„e„//a„) , 

and, using (iii) and P7, 

Kic) = A*(c) + o(i^)-ilog/„ + o(i^)+0(6„), 
as required. □ 
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